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Abstract. We show that for some absolute (explicit) constant C, the follow- 
ing holds for every finitely generated group G, and all d > 0: 
If there is some Rq > exp(exp(Cd ( - : )) for which the number of elements in a 
ball of radius Ro in a Cayley graph of G is bounded by Rq , then G has a finite 
index subgroup which is nilpotent (of step < C d ). An effective bound on the 
finite index is provided if "nilpotent" is replaced by "polycyclic" , thus yielding 
a non-trivial result for finite groups as well. 



1. Introduction 

A famous theorem of Gromov [5] asserts that all finitely generated groups of 
polynomial growth (thus, in the notation used below, one has \Bs(R)\ < R d for 
some d and all sufficiently large R, where S is a fixed set of generators) are virtually 
nilpotent. This was generalized by van der Dries and Wilkie [30] by assuming the 
polynomial growth condition \Bs(R)\ < R d at an infinite number, rather than all 
scales. A second proof of this fact, which for the first time released the dependence 
on the involved solution to Hilbert's 5 th problem, was given recently by Kleiner 
[5j , motivated by work of Colding-Minicozzi pQ and using the theory of harmonic 
functions. In this paper, we refine Kleiner's work to obtain a further strengthening 
of Gromov's theorem that only requires the polynomial growth condition at one 
(sufficiently large, yet explicit) scale, and gives quantitative control on the nilpo- 
tency degree. To state precisely the result (mentioned at the abstract above), we 
need some notation. 

Definition 1.1 ((Ro, c?)-growth groups). A finitely generated group is a pair G = 
(G, S), where G is a group, and S C G is a finite non-empty symmetric set which 
generates G (thus S~ x := {s _1 : s £ S} = S). For each g £ G, we define 
\\g\\s '■= inf{n : g £ S n } to be the length of the smallest word with alphabet S 
that evaluates to g. For each R > 0, we define Bs(R) := {g £ G : \\g\\s < R} to 
be the collection of words in S of length at most R. For Ro,d > 0, we define a 
(Ro, d) -growth group to be a finitely generated group (G, S) such that 

(1) \B s (Ro)\ < R d - 

Remark 1.2. One can of course generalize (JlJ by replacing the right-hand side of 
Rq by CRq for some additional parameter C, as is customary in the literature. 
But this would add a new parameter to an already complicated notational system 
and so we have chosen to drop this parameter, as one can partially simulate it by 
increasing d slightly and assuming Ro is large. Note that a finitely generated group 
(G, S) has polynomial growth if and only if there exists a d such that (G, S) is a 
(Ro, G?)-growth group for all sufficiently large Rq. 
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Definition 1.3 (Quantitative finite index). Let R,K>0. A finitely generated 
group (G',S') is said to be a (K, R) -subgroup of (G,S) if G" is a subgroup of G, 
S' C B S (R), and B S (K + 1) C • B S >(K). If we drop the final condition 

+ 1) C ) • B S '(K), we say that G is a (oo, R)-subgroup of G. 

iJemart 1.4. Observe that if (G', 5') is a (K, i?)-subgroup of (G, S) then B<?(r+1) C 
Bs(r) ■ Bs'(K) for all r > K, and on iterating this we conclude G C Bs(K) • G'. In 
particular, G' is a finite index subgroup of G of index at most \Bs(K)\. Conversely, 
if (G', S') has finite index in (G, S*), then we can write G as a finite union of cosets 
x\ ■ G', . . . , x m ■ G' of G', and in particular one has the relations 
for all 1 < i < to, e G S, and some 1 < G to, g ei i G G'. If one then sets 
i? := sup{||e'|| s : e' G S'} and if := sup{||^|| 5 : 1 < i < to} U {||fl e ,i||s' : 1 < 
i < to, e G 5} we see that (G',S") is a (if, i?)-subgroup. Note, however, that this 
argument does not (and cannot) give effective bounds on K, R. 

Example 1.5. Let to = 2p — 1 be a large odd number. Then the (additive) finitely 
generated group (Z/mZ, {—2, +2}) is a (2, 2)-subgroup of (Z/toZ, {—1, +1}), while 
conversely, (Z/mZ, {— 1, +1}) is merely a (2,p)-subgroup of (Z/mZ, {— 2, +2}). 
The intuition here is that while {—2, +2} does generate the element 1, this is a 
"global" fact (relying on the parity of to) rather than a "local" one, and thus can- 
not be detected in the limit p — > oo if one is only allowed to perform a bounded 
number of group operations. Thus we see that this quantitative notion of finite 
index not only measures the index of G' in G, but also the relative position of S 
and S'. This kind of "practical" interpretation of abstract (and often trivial) group 
theoretic notions is necessary in order to perform the quantitative arguments in 
this paper properly. 

Remark 1.6. It is immediate that if (G, S) is a (Rq, cZ)-growth group, and (G 1 , S') is 
a (K, i?£)-subgroup of (G, S) for some < k < 1, then (G', S') is a (i?o~ K , d/{l-n))- 
growth group. This is analogous to the obvious fact that any finite index subgroup 
of a group of polynomial growth, remains of polynomial growth. 

Definition 1.7 (Virtual nilpotency). Let K,R,s,D > 1. A finitely generated 
group (G,S) is said to be (s, D)-nilpotent if \S\ < D and G is nilpotent of step 
at most s (i.e. every s'-fold iterated commutator vanishes for s' > s). A group 
(G,S) is said to be (K,R,s,D) -virtually nilpotent if it contains a (s, _D)-nilpotent 
(K, i?)-subgroup (G',S"). 

Theorem 1.8 (Quantitative Gromov theorem). Let d,R n > 0, and assume that 

Ro > exp(exp(Gd c )) 

for some sufficiently large absolute constant C. Then every (Ro, d)- growth group is 
(K(Ro,d), K(Ro,d),C d , C d ) -virtually nilpotent for some K(Ro,d) depending only 
on Ro,d. 

In order to simplify the exposition somewhat, we do not give an effective bound 
for K(Ro,d) in our arguments, relying instead on an ineffective compactness argu- 
ment to establish its finiteness. However, it is possible to eradicate this compactness 
argument from the proof by standard "quantifier elimination" techniques, at the 
cost of making it substantially lengthier (and the final bound for K(Ro, d) obtained 
is quite poor, of Ackcrmann type in d). On the other hand, the arguments do give 
an effective value for G (it appears that G = 100 works, even if for clarity we 
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won't keep track of this aspect, in which no tightness is claimed). We discuss full 
effectivization issues in Section [TH along with the following result: 

Theorem 1.9 (Fully quantitative weak Gromov theorem). Let d > and Rq > 0, 
and assume that 

R > exp(exp(Cd c )) 
for some sufficiently large absolute constant C . Then every (Ro, d)-growth group 
has a normal subgroup of index at most exp(i?Q Xp ^ e:Ep( ' d ^) which is polycyclic. 

This result is in fact established along Sections [6lll0l (see Proposition 15 . 21 below) . 
and can be recommended to the reader as a natural "resting point" along the way 
to the full proof of the main Theorem 11.81 It captures the quantitative outcome 
made out of Kleiner's approach, and avoids completely the Milnor-Wolf part of 
the proof, which is only made semi-quantitative here. Note also that a completely 
effective version of Theorem 1 1.81 is still available when the group is assumed torsion 
free; see Corollary |15.6l 

We have the following immediate corollary of Theorem 11.81 

Corollary 1.10 (Slightly super-polynomial growth implies virtual nilpotency). Let 

(G, S) be a finitely generated group such that 

\B S {R)\ < i? c(loglogfl)C 

for some R > 1/c, where c > is a sufficiently small absolute constant. Then G is 
virtually nilpotent. 

Remark 1.11. By using completely ineffective compactness arguments, it has been 
well known that Gromov's theorem implies the existence of some super-polynomial 
growth function for which the Corollary holds, although no such explicit function 
was known before. It has been proposed that the subradical growth type exp(Cy / n) 
might work (see [3], [14] for some results in this direction). It is also interesting to 
compare the situation with Segal's refutation [25] of a conjecture of Lubotzky, Pyber 
and Shalev [16] concerning the growth type of the number of finite index subgroups 
of a residually finite group (as a function of the index). It has been shown in [25] 
that here no super-polynomial bound could yield the same characterization as the 
polynomial one, of being virtually solvable of finite rank (a remarkable result of 
Lubotzky, Mann and Segal |15) . which relies, among other things, on Lazard's deep 
p-adic analogue of Hilbert's 5 th problem |11|). 

The nilpotent group generated by Theorem 11.81 has step at most C d , but an 
inspection of the proof shows that it also has Hirsch length (i.e., sum of torsion 
free ranks of the abelian quotients in a grading) at most C d . It is a standard 
computation (which also follows from the well known Bass-Guivarc'h formula for 
the growth of nilpotent groups) , that nilpotent groups of Hirsch length r and step 
s have polynomial growth of order at most 0(rs). We thus conclude: 

Corollary 1.12 (Polynomial growth at one scale implies polynomial growth at all 
scales). Let d > and Rq > 0, and assume that 

Ro > exp(exp(Gd c )) 

for some sufficiently large absolute constant C. Let G be a (Ro,d)-growth group. 
Then one has 

\B S (R)\ < KR cd 
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for all R > 1, where K — K(Rq, d) depends only on Rq, d. 

Thus, polynomial growth of order d at one (moderately large) scale implies poly- 
nomial growth of order 0(l) d at all subsequent scales. We do not know if this 
exponential loss in the polynomial growth degree is necessary. 

1.13. Overview of proof. Our arguments broadly follow the strategy used by 
Kleiner[5] to prove Gromov's theorem, being based in particular on the study of 
harmonic functions on the Cayley graph associated to (G, S). Kleiner's proof pro- 
ceeded, roughly along the following steps: 

(i) If G is infinite amenable (or is merely without property (T)), then it admits 
a fixed point free affine action on a Hilbert space, one of its orbit maps 
being a Hilbert space valued non-constant Lipschitz harmonic function on 
G. After taking a section, this implies the existence of a scalar non-constant 
Lipschitz harmonic function. 

(ii) If G has polynomial growth, then the space of real valued Lipschitz (or 
fixed polynomial growth) harmonic functions on G is finite-dimensional. 

(iii) From (i) and (ii), an infinite group G of polynomial growth admits a finite- 
dimensional representation with infinite image, or equivalently, a normal 
subgroup H whose quotient G / H is an infinite linear group. 

(iv) Any linear group of polynomial growth is virtually solvable. 

(v) From (iii) and (iv), an infinite group G of polynomial growth admits a 
normal subgroup H with virtually solvable or better yet virtually infinite 
cyclic - quotient G/H. 

(vi) The group H obtained in (v) as the kernel of a Z-target homomorphism of 
a finite index subgroup of G has slower growth, and thus (by induction) is 
virtually nilpotent. Hence G is virtually solvable (in fact, polycyclic). 

(vii) Any virtually polycyclic (or merely solvable) group of polynomial growth 
is virtually nilpotent. 

The idea is then to make the proofs of the statements (i)-(vii) as elementary as 
possible, so that they may be made quantitative. Note that for these purposes, 
statements involving infinite objects must be re-proved in a finitary version, and 
in the fully effective results (e.g. Theorem II. 9[) only finitely many elements of the 
group are actually involved. This is a rather non-conventional difficulty in geometric 
group theory, where Gromov's theorem is a cornerstone. We next remark on our 
modification of these steps. 

Statement (i) is made quantitative in Section |6] The proof given in [8] uses 
ultralimits and considerations related to Kazhdan's Property (T), and is thus of a 
qualitative nature. However, it turns out that one can use the spectral theory of the 
Laplacian and Young's inequality, to obtain a quantitative version of an existence 
theorem for non-constant Lipschitz harmonic functions on all infinite groups. 

Statement (ii), which we discuss in Section [3 is the heart of Kleiner's approach 
(inspired by the related work of Colding-Minicozzi 1 ). His arguments are already 
quite quantitative. Two noticeable (related) difficulties that arise when trying to 
quantify Kleiner's proof are that the scale R for which a finite dimensional space 
of harmonic functions injects into the ball -B(e, R) is not effective, and that in this 
scale one lacks a priori lower bound on the positive value of the determinant of an 
associated positive definite quadratic form. 
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For statement (iii), unlike the abstract setting of Kleiner's proof, the space we 
work with is the one appearing in Statement (ii) (modulo the constants), in which 
the group operates naturally (preserving the Lipschitz norm). This is trivial in 
the qualitative world, but does require a certain amount of care in the quantitative 
setting (for instance, one needs quantitative versions of the assertion that any finitc- 
dimensional vector space has a basis), and is done in Section [51 

Statement (iv), established in Section |9l was proven in [8J (as in Gromov's origi- 
nal [5]) using the Tits alternative, or by a weaker variant of that alternative due to 
the first author [23]. However, for our purposes we provide a completely different, 
elementary proof, which makes crucial use of the fact (not exploited previously) 
that the linear representation in (iii) ranges in a compact Lie group (the one pre- 
serving the Lipschitz norm in (iii)). Our arguments here are inspired by the famous 
Solovay-Kitaev theorem [26 in the theory of quantum computations, which itself 
may be viewed as a variant on the well known Zassenhaus-Kazhdan-Margulis The- 
orem (cf. [7[ Section 4.12]). As in the aforementioned results, we obtain a quan- 
titative version of (iv) based on the fact that commutators of matrices near the 
identity collapse fast towards it, and hence cannot be non-trivially accommodated 
without a "rapidly growing supply" of group elements, which will be incompatible 
with the polynomial growth hypotheses. Furthermore, the compactness of the am- 
bient group ensures the existence of a finite index subgroup of matrices that are 
close enough to the identity that the previous considerations apply. 

Statement (v) is formalized in Proposition [521 an d follows simply by putting all 
the previous statements together. 

Statements (vi), (vii) are formalized together as Proposition l5.3[ Statement (vii) 
is a result of Milnor|20 and Wolf 32]. The arguments of Milnor quickly allow us 
to reduce to a polycyclic setting in which G is an extension of a cyclic group by 
a virtually nilpotent group. The main task, as in [32 , is to then show that outer 
automorphisms of virtually nilpotent groups which are of polynomial growth, are 
necessarily virtually unipotent. To do this, one must first eliminate the torsion from 
the nilpotent group, and then after taking quotients one is faced with understanding 
linear transformations of polynomial growth on a free abelian group Z d . But this 
can be handled using known results on Mahler measure of algebraic integers. It 
turns out that all these steps can be made semi-quantitative (in the sense that one 
does not control the index behind the modifier "virtually"), and this is done in 
Sections riTlfMl this then gives a quantitative (but ineffective) proof of Theorem 
11.81 by a compactness argument given in Section [3] In Section [15] we discuss the 
changes needed to make the semi-quantitative proof fully effective. 

There are quite a few applications of Gromov's theorem where an effective version 
of it is of interest. We illustrate this only briefly in one geometric setting which 
was discussed in Gromov's original paper [5] as Corollary 115.21 below . 

Finally, as a by-product of our effort one obtains a simplified, particularly ac- 
cessible proof of Gromov's original qualitative theorem. It avoids also the Tits 
alternative and any use of p-adic numbers, and can be fully digested using basic 
background in linear algebra and calculus. This soft proof is presented in our com- 
panion paper [24], along with various other results and questions related to the 
current work and around the theme of Lipschitz harmonic functions on groups. 
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1.14. Comparison with other work. In the literature there seems to be only 
one previous effective result related to Gromov's theorem, due to van den Dries 
and Wilkie [21], which handles the subquadratic growth case d < 2 by using clever, 
elementary, and quite combinatorial arguments. (The case d < 1 is trivial; see 
Lemma 15.11 ) 

A related quantitative formulation of Kleiner's argument was recently given by 
Lee and Makarychev [17]. They worked in the setting of a finite group (G, S) obey- 
ing a doubling condition \Bs(2R)\ < 2 d \Bs{R)\ at all scales R > 0, as opposed 
to the (weaker) assertion of being an (Ro, d)-growth group at a single scale. Using 
Kleiner's method, they obtained upper bounds (roughly of the order of exp(0(eP))) 
on the multiplicity of eigenvalues of the Laplacian, and also obtained a subgroup of 
G of index bounded by exp(exp(0(ci 2 ))) which had a homomorphic image onto a 
cyclic group Z/MZ of cardinality at least M > exp(-0(d))|G| cxp (~°( <i2 ». In the 
infinitary setting, the presence of a bounded index subgroup with a large homo- 
morphic image can be used to locate a commutator subgroup with a reduced order 
of growth, to which one can apply an induction hypothesis to obtain a Gromov- 
like theorem. Unfortunately, a technical obstruction in this finitary case is that 
the doubling condition in [T7] is required at all scales (up to the diameter of G), 
whereas the homomorphic image only yields a growth reduction up to scale M or 
so. Our main Theorem 11.81 can be viewed as an answer to a question raised in 
[17j . regarding whether Kleiner's methods can be adapted to the finitary setting 
assuming a polynomial growth hypothesis rather than a doubling condition. 

Another related result, of Milnor-Wolf type, was obtained recently by the second 
author in |28j . There, it was shown that if G is a {Ro, <i)-growth group which is 
solvable of derived length at most I, with Rq sufficiently large depending on l,d, 
then G contains a nilpotent group of step at most s(l, d) and index at most Rq^ 1 '^ 
for some s(l,d),C(l,d) depending only on I and d. This result has a much better 
control on the index of the nilpotent subgroup than Theorem 11.81 but it gives a 
much poorer (though still effective) bound on the step, or on the size of Ro required, 
and is, in addition, restricted to the solvable case. The methods used in that paper 
are quite different from those here, replying on additive combinatorics rather than 
the theory of harmonic functions and linear representations. By combining the 
results in [25J with Theorem 1 15. 31 one can obtain a variant of Theorem 1 1.81 in which 
the bound K(Ro,d) is completely effective, but the quantities exp(exp(G<i )) and 
C d appearing in that theorem are replaced by much larger (but still effective and 
explicit) functions of d. We omit the details. 

1.15. Acknowledgments. The authors thank Emmanuel Breulliard for valuable 
discussions, and the anonymous referee for corrections. The first author was sup- 
ported by ISF and NSF grants number 500/05 and DMS-0701639 resp. The second 
author is supported by a grant from the MacArthur Foundation, by NSF grant 
DMS-0649473, and by the NSF Waterman award. 

2. Notation 

If E, F are two subsets of a multiplicative group G, we use E ■ F to denote the 
product set {ef : e G E, f E F}, and E^ 1 to denote the inverse set {e _1 : e € E}. 
In an additive group we can similarly define the sum set E + F and reflection 
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— E. We also define dilates k ■ E := {fee : e € E} for E in an additive group and 
non-negative integers k. 

Given a group G, we define the commutator [<?, /i] of two group elements g, h € G 
by [<?, /i] := ghg~ x h~ x , and the commutator of two subgroups [if, if] to be the 
group generated by {[h, k] : h £ H, k € if}. We define the lower central series 
G = G\ > G2 > • • • by Gi := G and Gj+i := [Gj,G], and the derived series 
G = GW > G^ > ... by GW := G and G(' i+1 > := [G«,G«]. We say that G is 
nilpotent of step at most s if Gg+i is trivial, and solvable of derived length at most 
I if G (/+1) is trivial. 

We write X = O (Y), X < Y or F > X to denote the statement that |X| < CY 
for some absolute constant G. 

3. A COMPACTNESS REDUCTION 

In this section we perform a compactness reduction to eliminate the role of 
the quantities K(Ro,d) appearing in Theorem 11.81 This will simplify the proof 
substantially, at the cost of rendering the final value of K(Rq, d) obtained ineffective; 
but see Section [15] for how one could avoid the use of compactness to obtain an 
effective value of K(R , d). 

We first remove K and R from the definition of virtual nilpotency. 

Definition 3.1 (Virtual nilpotency, again). Let s, D > 1. A finitely generated 
group (G,S) is said to be virtually (s, D) -nilpotent if it contains a finite index 
subgroup that is (s, D)-nilpotent. 

We now claim that Theorem 11.81 follows from the following "semi-quantitative" 
variant: 

Theorem 3.2 (Semi-quantitative Gromov theorem). Let d > and Rq > 0, and 

assume that 

R > exp(exp(Gd c )) 
for some sufficiently large absolute constant C. Then every (Rq, d)- growth group is 
virtually (C d ,C d )-nilpotent. 

We now give the (standard) compactness argument that lets us deduce Theorem 
11.81 from Theorem 13.21 (such argument appeared already in Gromov's [S] , while the 
general formalism of the "space of marked finitely generated groups" underlying it, 
was introduced later by Grigorchuk in [4]). 

Proof of Theorem \1.8\ assuming Theorem \3.2\ Let G be the absolute constant in 
Theorem 13.21 Suppose Theorem 11.81 failed, then we could find Ro,d obeying the 
specified bound (for this choice of G), and a sequence (Gn,Sn) of (i?o, d)-growth 
groups with N — > oo such that (Gn, Sn) is not (N, N, C d , G d )-virtually nilpotent. 
Observe that \Sn\ < \£>s N {Ro)\ < Rq is uniformly bounded, hence by passing to 
a subsequence we may assume \Sm\ = k for all N and some fixed k. Identify now 
each (Gat,£/v) as a quotient of the free group (F k ,S) under the homomorphism 
mapping the free generators S to Sn, and denote the kernel by Mjv < i*fe. By 
diagonalization process and after passing to a subsequence, we may assume that 
the sets Mjv converge, i.e. they eventually agree on every finite subset of F^. Their 
limit, M, is of course a normal subgroup of Fk- Obviously G = F^/M with the 
projection of S is a (Rq, c?)-growth group, hence by Theorem l3.2l it has a finite index 
(C d , G rf )-nilpotent subgroup. By RemarkO] (G, S) is (K, A, C d , G rf )-subgroup for 
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some K. To complete the argument it only remains to observe the following three 
facts: 

(1) The group G, being virtually nilpotent, is finitely presented; 

(2) If a sequence of marked groups converges to a finitely presented group as 
above, then from some point on they are all quotients of it (indeed, this 
happens when the finitely many relations of the limit group stabilize in the 
sequence); and 

(3) The property of being a {K, K, C d , G d )-nilpotent group is inherited by quo- 
tients. 

□ 

For technical reasons it is convenient to modify the above definition slightly, by 
replacing the number D of generators and the step s with the Hirsch length of 
the nilpotent group. Recall that the Hirsch length of a nilpotent group (or more 
generally, a polycyclic group) is the sum of the torsion-free ranks of the quotients in 
any normal series of that group with abelian quotients (e.g. for a nilpotent group, 
one could use the lower central series) . 

Definition 3.3 (Virtual nilpotency, yet again). Let r > 1. A r-nilpotent group is 
a finitely generated nilpotent group of Hirsch length at most r (and thus step at 
most r, as well). A finitely generated group (G, S) is said to be virtually r-nilpotent 
if it contains a finite index subgroup that is r-nilpotent. 

By a result of Malcev [T^] if G is nilpotent then any set S C G whose projection 
generates the abelianization G/[G, G], generates all of G. Hence, a r-nilpotent 
group contains a finite index subgroup generated by at most r generators (or 2r, if 
one enforces symmetry) . Thus one may replace the conclusion of Theorem 13.21 by 
the assertion that every (i?o, (i)-growth group is virtually G d -nilpotent. 

It remains to prove Theorem 13.21 with this modification. This is the purpose of 
the remaining sections of the paper. 

4. Generator bounds 

In this section we collect a number of useful results which exploit polynomial 
growth hypotheses to locate bounded sets of generators for various types of groups; 
such results will be used frequently in the sequel. 

First, we observe that the property of having quantitative finite index is transi- 
tive. 

Lemma 4.1 (Transitivity of quantitative finite index). // (G',S") is a (K,R)- 
subgroup of(G,S), and (G",S") is a (K' , R')- subgroup of{G',S'), then {G",S") 
is a (KK'(K + RK' + 1), RR')- subgroup of (G, S). 

Proof. Clearly S" C B S >(R') C B S (RR'). Next, since 

B S (K + 1) CB S (K)-B S ,(K) 

we have 

B S (n) cB s (K + n)c B S (K) ■ B s \nK) 
for any n > 1. Similarly 

B s (n') C B s ,(K' + n') C B S ,{K') ■ B s »(n'K') 
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for all n' > 1. Combining the two, we conclude that 

B S {n) C B S {K) ■ B S ,(K') ■ B s „(nKK') 
for all n> 1. Since Bs>(K') C Bs(RK'), we conclude that 

£<j(rc) C B S (K + RK') ■ B s „{nKK'). 
Setting n := K + RK' + 1, we conclude that 

BsCK" + ilK"' + 1) C B S (K + RK') ■ B S „(KK'{K + RK' + 1)). 

and thus 

B S (KK'(K + RK' + 1) + 1) C B S (KK'(K + RK' + 1)) • B s » (KK'(K + RK' + 1)) 
and the claim follows. □ 

Next, we observe that a (Ro-, d) group has a bounded number of generators, after 
passing to a finite index subgroup. 

Lemma 4.2 (Generator reduction of (Ro, d)-growth groups). Let d > 1, < K < 1, 
and Rq > 100 1 / K , awe? let (G,S) be a (Ro,d)- growth group. Then there exists a 
(Rq 7 Rq) -subgroup (G", S') of G with \S'\ < 0(l) d / K . In particular (by Remark\ll$) 
{G',S') is a(Rl- K ,d/(l-K))-growth group. Furthermore we have G = G' ■ Bs(Rq). 

Remark 4.3. In practice, we will take k to be small compared to d (e.g. k — l/100<i), 
and so the slight degradation d h-> d/(l — k) in the order of growth here will be 
acceptable (at a later stage of the argument, we will obtain a reduction in the 
growth order by 1, which will more than compensate for these sorts of losses). 

Proof. Clearly 

l<\Bs(l)\<\B s (R*)\<\B s (R )\<R d . 
By the pigeonhole principle, one can thus find a radius 1 < r < Rq/10 such that 

|B S (10r)| ^0(l) d / K \B s (r)\. 

Fix this r. Now, let X be a maximal subset of Bs(ir) such that the sets x ■ Bs{r) 
for x £ X are disjoint. Since the x ■ Bs(r) are contained in Bs{5r), we have 

\X\ < \Bs(5r)\/\Bs(r)\^0(l) d ^. 

On the other hand, by construction of X we have the covering property 

B s (4r) d X ■ B s (2r). 

In particular, if we set S' := X U X^ 1 , and let G' be the group generated by S', 
then \S'\ < 0(l) d / K and 

(2) B s {Ar) cS'-B s (2r); 

B s (nr) C B s ,{n) ■ B s {2r) 
for n = 1, 2, . . .. In particular, we have 

B s (R5 + l)cB s ,{R$).Bs(R5) 

which on inversion gives 

B s (R5 + l)cB s (R5)-Bs'(RIS). 
On the other hand, we have S' C Bs{^r) C Bs(Rq), and the claim follows. □ 
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Remark 4.4. Because of this lemma, we will be able to safely absorb a number of 
terms involving the size 151 of the generating set in the arguments that follow. (One 
should think of d and 1/k as being bounded; the key point is that the bound on 
IS* | is independent of Rq.) 

Remark 4.5. If one does not pass to a finite index subgroup, then one may need as 
many as logi?o generators. Indeed, consider the abelian group G — Z% x Z, where 
n ~ log Rq. Then Bs(Rq) is of polynomial size in Rq, but one needs n + 1 ~ log Rq 
generators in order to generate the whole group G. In the converse direction, if 
Bs(Rq) < Rq < 2 Ro , then if we let s\, . . . , s n be a maximal sequence in S which 
is dissociated (i.e. the words s^ 1 . . . s^ for ix,...,i n £ {0,1} are different), then 
one easily verifies that 2™ < \B s (Rq)\ < Rfi < 2 R ° and thus n < dlog 2 R Q < Rq, 
and that the set {si, . . . , s n , , . . . , s" 1 } generates S and thus G. Thus G can be 
generated by 0(d log Rq) generators in this case. 

Remark 4.6. In many applications of Theorem 11.81 S would already be bounded. 
But in our proof of Theorem 11.81 (which is based on an induction on d) , it will 
become necessary at some stage in the proof to pass from G to a subgroup such 
as [G,G], which need not have a bounded number of generators. It is then that 
Lemma 14.21 becomes necessary. 

We shall need two further results in a similar spirit. The first asserts that if a 
(Ro, (Z)-growth group G can be generated by a small number of generators and has 
polynomial growth, then so does [G, G] (cf. page 61], or [2"U]): 

Lemma 4.7 (Generator reduction for a commutator group). Let d > 1, < k < 1, 

and Rq > Cd c l K for some sufficiently large absolute constant C , and let (G, S) 
be a (i?o,^)- growth group. Then there exists a set of generators S' of [G, G], each 
of the form g[e, e'].g _1 for some e,e' € S and g £ Bs(\Rq — 4). In particular, 
([G,G],S') is a (oo,R%) -subgroup of(G,S), and thus (by Remark^ ([G,G],5') 
is a (RQ~ h ,d/{l — n))-growth group. 

Proof. It is not difficult to see that [G, G] is generated by the elements g[e, e'\g~ x — 
\geg~ 1 , ge' g~ x \ where e, el £ S and g £ G (since modulo the normal group generated 
by these elements, all the basis elements e, e' commute). The difficulty is to replace 
this infinite generating set by a finite one. 

For any r > 1, let A r := {g[e, e']g- 1 : e,e' £ S,g £ B s (r)} C B s {2r + 4), 
and let A< r := A r ■ A r ^ x ■ ... ■ A . Then A< r C B s (0{r + l) 2 ), and thus by 
hypothesis we have |^4< r | < Rq for r < cR^/ 2 and some small absolute constant 
c > 0. By hypothesis (and if G is large enough), Rq^ 2 is larger than a large multiple 
of dlogi?o- In particular, one can find a 2 < ro < \Rq — 4 such that 

L4< ro |<2^ 2 . 

From the pigeonhole principle, we can thus find 2 < r < ro such that 

(3) |A< r+1 | < 2|i4< P |. 

This implies that for any x £ A r+ \, that x ■ A< r and A< r overlap, thus 

(4) A< r +\ C A< r ■ A<],. 

In particular, A r+ \ is contained in the group generated by A r , which implies that 
the act of conjugation by any element g £ S preserves the group generated by 
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A r . Since A r also contains the commutators [e, e'] for e,e' € S, we conclude that 
S' := A r generates [G,G]; also, S' is symmetric by construction. The claims of the 
lemma then follow. □ 

Next, we show that finite index subgroups of finitely generated groups continue 
to be finitely generated in a very quantitative manner. 

Lemma 4.8 (Qualitative finite index implies quantitative finite index). Let d > 1, 

I > 1. Let G = (G, S) be a finitely generated group, and let G' be a finite index 
subgroup of G with index \G : G'| < /. Then there exists a set S' C Bs(2I + 1) of 
generators of G' such that (G',S') is a (21 + 1,2/ + l)-subgroup of G. 

Proof. For each r > 0, the set Bs(r) ■ G' is a union of left cosets of G' . The number 
of such cosets is of course |G : G'|. Thus by the pigeonhole principle, one can find 
< r < I such that B s (r + 1) ■ G' = B s (r ) ■ G' . In particular, 

Bs(r + 1) C B s (r ) • G. 

If we set S' := G' (1 B$(2ro + 1), then S' is symmetric and contained in Bs(2L + 1), 
and by the triangle inequality we have 

5 S (r„ + l)cB s (r„)-y. 

Multiplying both sides on the left by B$(r — ro), we obtain 

B s (r + l)cB s (r)-S' 

for all r > ro (and hence for all r > 21 + 1). In particular 

(21 + 2) c B S (2I + 1)-B S > {21 + 1) 

and the claim follows. □ 

5. Reduction to two key propositions 

Theorem 13. 21 (modified as discussed at the end of Section [3]) is deduced from one 
easy proposition and two difficult ones. We begin with the easy proposition, which 
handles the base case d < 1: 

Lemma 5.1 (Sublinear growth case). Suppose that (G,S) is a (R,d)-growth group 
for some < d < 1 and R > 1. Then G — Bs(R) and so G is finite with \G\ < R d . 

Proof. The \Bs(r)\ for < r < R are integers between 1 and \Bs(R)\ = R d < R 
that increase in r. Thus by the pigeonhole principle, there exists < r < R — 1 
such that \Bs(r)\ = \B$(r+ 1)|, thus B${r) = Bs{r+1). Iterating this we see that 
B s (r) = B s {r') for all r' > r, thus G = B s {r) = B S {R) and the claim follows. □ 

The higher order growth cases d > 1 will then be handled by an induction on d. 
The first major step in this process is to obtain a reduction in the order of growth 
for a certain commutator subgroup: 

Proposition 5.2 (Reduction in growth order). Let Ro,d > 1, and let (G,S) be a 
(Ro, d)-growth group. Assume that 

(5) Ro > exp(exp(Cd c )) 

for some sufficiently large absolute constant C. Then at least one of the following 
holds: 

(i) G = S s (i?o xp(oxp(0(<i)0<1))) ) (in particular, G is finite). 
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(ii) There exists a (R^ 10 , Rq' ' 1C ')- subgroup (G',S') of G and a positive integer 
I = 0(d°^) such that (G") (i) is generated by a set (S') (0 C B s ,(Rl /W ) for 
which ((G')( l \ (S")0) is a (R, d — 0.9)-growth group for some Rq 1 / 10d < 

R < Rq. 

This proposition will be established in Sections IMTuI using (quantitative versions 
of) some arguments of Kleiner [8] . The bounds here are completely effective (and 
far superior to those in Proposition 15.31 below) . 

To exploit this growth reduction we need the following variant of Theorem 11.81 

Proposition 5.3 (Semi-quantitative Milnor-Wolf theorem). Let s,l,r, Ro,d > 1, 
and suppose that 

Ro > r c (Cd) cdl 

for some sufficiently large absolute constant C. Suppose we have a short exact 
sequence 

^ H -^G^ L^O 
of groups, where G — (G, S) is a (Ro, d)-growth group, H = (H, Sh) is a virtually 
r-nilpotent (see Def. \3.3\) (oo, Rq /11 ')- subgroup of G, and L is solvable of derived 
length at most I. Then (G, S) is virtually r + lO(l) d -nilpotent. 

We prove this proposition in Sections [TTIITil The arguments are quite different 
from those used to prove Proposition 15.21 being related to the arguments used by 
Milnor [20] and Wolf [32]. A key new technical difficulty in this "single-scale" 
setting, not present in earlier "multi-scale" work, is that the index of the nilpotent 
subgroup of H may be so large that this subgroup is not "visible" at the one scale 
Ro that we directly control. 

In the remainder of this section we show how the above three propositions imply 
Theorem 13.21 (modified as discussed at the end of Section [3]). From Lemma 15.11 
we see that Theorem 13.21 holds for d < 1 . We may thus assume inductively that 
d > 1 and that the claim has already been proven for d — 0.9. By Proposition 
15.21 we see that either conclusion (i) or conclusion (ii) of that proposition holds. If 
conclusion (i) holds, then G is finite and the claim follows. If instead conclusion (ii) 
holds, then by induction hypothesis we see that ((G') (i) , (S') (0 ) is virtually C d -°- 9 - 
nilpotent. On the other hand, G'/(G')® is clearly solvable of derived length at 
most I = 0{d°^ 1 '). Applying Proposition 15.31 to the short exact sequence 

-> {G') {1) -> G' -> G'/{G' f ] -> 

we conclude (if C is large enough) that G' is virtually C -virtually nilpotent. Since 
G' is a a finite index subgroup of G, the claim then follows. 

6. First step for Proposition 15.21 produce a non-trivial Lipschitz 
almost harmonic function 

We now begin the proof of Proposition 15. 2\ following Kleiner [8]. Kleiner's 
argument is based on harmonic functions, and in particular on studying the class 
of functions u : G — > R which are both Lipschitz and harmonic with respect 
to the set S of generators. In our quantitative applications, the group G could 
well be finite (with extremely large cardinality), and so there may not be any 
harmonic functions other than the constants. To deal with this we need to consider 
instead a somewhat larger class of almost harmonic functions. A good example is 
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provided on the finite group G = Z/NZ (with the standard generators +1, — 1, and 
with N large) by the function u(x) :— Nsm(2Trx/N). This function is Lipschitz 
(with Lipschitz norm approximately 2ir), and "almost harmonic" in the sense that 
u(x + 1) + u(x - 1) - 2u{x) = 0{l/N) for all x. 

It is known (see [H Appendix] and the references therein) that infinite groups 
admit non-trivial Lipschitz harmonic functions. It is thus not surprising that all 
"large" groups (finite or infinite) admit non-trivial almost harmonic Lipschitz func- 
tions. To see this, let us first formalize our definitions. 

Definition 6.1. Let u : G — > R be a function. The gradient Vu : G — > R s of a u 

is defined by the formula 

Vu(x) := (u(xs) - u(x)) se s 

so in particular 

\Vu(x)\:= (£Hxs)-u(x)\ 2 ) 1 / 2 . 
ses 

Dually given a vector-valued function F = (F s ) se s '■ G — > R s , we can define its 
divergence V ■ F : G — > R by the formula 

V • (F s ) ses (x) =J2f s (xs- 1 ) - F s (x). 

The Laplacian of a function u : G — > R is defined by the formula 

Au := -V • Vw 
or more explicitly (using the symmetry of S) 

Au(x) = 2\S\u(x) - 2^2 

The Lipschitz (semi)-norm ||u||Lip of u is defined as 

IMkip := ||Vu|U»r G ) = sup |Vu(x)|. 

xeG 

A function u : G — > R is harmonic if Au = 0. If e > 0, we say that a function 
u : G — > R is e-harmonic Lipschitz if || tt|| Lip < 1 and ||Au||^°o( G ) < e. 

Remark 6.2. The conventions for the Laplacian A and Lipschitz norm may differ 
by some (ultimately irrelevant) constant factors from other definitions in the liter- 
ature. With these conventions, the function ^JVsin(2ra/Af) is 0(l/iV)-harmonic 
Lipschitz on Z/NZ. 

Proposition 6.3 (Infinite groups have non-constant Lipschitz harmonic functions). 

Let R > 1, and let (G,S) be a finitely generated group. Then at least one of the 
following statements is true: 

• G = B S {R). 

• There exists a 0(\S\R~ 1 / 3 )-harmonic Lipschitz function u : G —¥ R with 
|V«(id)| > 1/\S\. 

Remark 6.4. Comparing this with the example u(x) = ^Nsm(2Trx/N) on Z/NZ 
we see that the exponent —1/3 in the Proposition should probably be a — 1. How- 
ever, any positive exponent will suffice for our application. 
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Proof. Given two functions u, v : G — > R, we formally define the convolution u*v : 
G — > R by the formula 

u * v(x) := u(y)v(y~ 1 x) — u{xy~ 1 )v{y). 
v&G y£G 

By Young's inequality, convolution is well-defined whenever u £ £ p (G),v £ £ q (G) 
and l/p + 1/q = 1/r + l for some 1 < p, q, r < oo, in which case we have 

(6) \\u*v\\ er{G) < \\u\\ ep{G) \\v\\ eq{G) . 
We observe the pleasant identities 

(7) V(/x * h) = h * V/ 2 

(8) V-(/*F) = /*(V-F) 
and thus 

(9) A(/i * / 2 ) = fi* A/ 2 . 

Fix R. We may assume that G ^ Bs(R), since are done otherwise. Let a 
be the measure a — tjt X) s eS ^ s > wnere i s the Kronecker delta. Observe that 
Am = 2|S , |(ii * a — u) for any u, where u * a(x) := u { x y~ 1 )°~{y) is the usual 
convolution operator. 

Let cr' m ' :— a * . . . * (7 he the m-fold convolution of u, and let / : G — >• R be the 
function 

1 R 
J R ^ 

m— 

Then we have ||/||^i(g) = 1 an d ||A/||^i(G) <C \S\/R. We divide into two cases. 

Case 1 ("non-amenable" case): ||V/||fi((j) > R~ 2 ^ 3 . By the pigeonhole 
principle, we can then find s € S such that the function f s (x) :— f(xs) — f(x) 
has i 1 norm at least |S , | _1 ||V/||^i(G). If we then define u :— h* /, where h(x) := 
sgn(/ s (a;- 1 ))/||V/||^(G), we see from ©, that 

l|Vtt|| € - (G) < 1 

and 

\u(s) - u(id)| > 1/|S| 

while from ©, © we have 

||Au||<- (G) « (|^|/i?)/i?- 2/3 

and the claim follows. 

Case 2. ("amenable" case): ||V/||;i( G ) < R~ 2/3 - If we set F := l/l 1 / 2 then 
we have the pointwise bound 

|VF| < |S| 1/2 |V/| 1/2 

thanks to the elementary estimate Ix] 1 ^ 2 — ly] 1 ^ 2 < \x — y] 1 ^ 2 for x,y > 0. Thus we 
have 

||F|| £2(G) = 1 and ||VF|| P(G) < l^iT 1 / 3 . 

Also, F is supported on Bs(R). Since G 7^ Bs(R) by hypothesis, F cannot be 
constant. In particular 

\\VF\\ eKG) > 0. 
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Thus if (If is the spectral measure of A relative to F, we see that iaf is a probability 
measure on [0,1] with 

< f x dp F {x) = \\VFf iHG) < \S\R- 1/3 . 
Jo 

This implies that 

<.|S|fl~ 1/3 r\S\R- 1/3 

0< x 2 dp F (x) < \S\R- 1/3 / x dp F (x). 



If we then let F' € £ 2 {G) be the spectral projection of F to [0, |5|i?" 1/3 ], we 
conclude that 

(10) < ||AF'||| 2(G) < \S\R- l I^F'\\% {G) . 

By the pigeonhole principle, we can find s € S such that the function F' s (x) := 
F'(xs) — F'(x) obeys the lower bound 

(11) \\K\\% iG ) > isr^n^o- 

If we define u := H * F, where H{x) := F'^x- 1 ) F\\% {G) , we see from ©, © 
that 

||Vm||£oo( G ) < 1 

while from (|11[) we have 

\u{s) - u(id)| > l/\S\ 
and from ©, ©, {TDJ) we have 

||Au||^ (G) < \S\R- 1 '* 

and the claim follows. □ 

Remark. See also the recent [12] for a related probabilistic construction of (Hilbert 
space valued) harmonic maps on amenable groups. 

Once one has one non-trivial almost harmonic function u, one can then cre- 
ate a further family p(g)u, g £ G of almost harmonic functions by translation: 
p(g)u(x) := u{g~ l x). Kleiner's approach to Gromov's theorem revolves around a 
study of this family, starting with the fundamental observation that this family is 
(approximately) finite-dimensional. We now turn to this important fact. 

7. Second step for Proposition 15 .2\ Kleiner's theorem 

Kleiner [5], building upon work of Colding and Minicozzi[T], proved that in a 
group of polynomial growth the space of Lipschitz harmonic functions is finitc- 
dimcnsional; indeed, an inspection of the argument shows that the dimension of 
this space is bounded by some constant C{d) depending only on the order d of 
growth. The main objective of this section is to prove the following quantitative 
version of this fact: 

Theorem 7.1 (Quantitative Kleiner theorem). There exists an absolute constant 
C > such that for any < k < 0.1, and any (R ,d)-growth group (G,S) with 
d > 1 and 



(12) Ro > K := (C\S\) 
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there exists a finite- dimensional subspace V of R G of dimension 

dim{V) = 0{\S\) 0{d3/K2) 

such that for every Rq K -harmonic Lipschitz function u : G -A R, there exists v £ V 
such that 

\\u-v\\ e2{Bs(Ri -, )} <R-^ d . 

We will prove Theorem 17.11 in several stages, following [8]. For any x € G and 
r > 1, we write B(x, r) := x ■ B$(r) for the ball of radius r centered at x. The first 
step is the following Poincare inequality (compare with one first proved by Colding 
and Minicozzi in [I] , and its adaptation to the group setting due to Kleiner-Saloff- 
Coste in Theorem 2.2]): 

Lemma 7.2 (Poincare inequality). Let f : G -A R, x £ G, and r > 1. Let 

fB(x,r) ■= \b{x,t)\ $B{x,r) f be the avera 9 e value °f f on B(x,r). Then 

II f f II S 9 \ B s( 2r )\ MvyfH 

11/ - JB(x,r)\\p-(B(x,r)) < & |jW r )| II V /|U 2 (B(x,3r))- 

Proof. By definition of the gradient, we have the pointwise bound 

< |V/(y 5 )| 

for all y,g £ G and s £ S. If we take g £ Bs(2r) and average this in £ 2 over all 
y £ B(x,r), we conclude that 

( E l/(^)-/(ra)l 2 ) 1/2 <l|v/||, 2(s(:c ,3,)). 

y£i3(a;,r) 

Telescoping this using the triangle inequality, we conclude that 

( E l/(ra) - /(J/)| 2 ) 1/2 < 2r-||V/|U 2( B(x,3.))- 

ytz-Bfa;,?") 

for all <? € Bs(2r). Summing in g using the triangle inequality, we conclude that 
( £ ( E l/(ra)-/(2/)l) 2 ) 1/2 < 2r\B s (2r)\\\Vfy (B(Xt3r)) . 

yeB(x.r) geB s (2r) 

But for any y £ B(x,r), we have 

l/(y)-/B(»,r)(tf)l<T^T| E \fW-f(y)\< T^TTi £ 

' z€B(x,r) ' S '^ r '' g£B s (2r) 

and the claim follows. □ 

For e-harmonic functions, we have a reverse inequality: 

Lemma 7.3 (Reverse Poincare inequality). Let f : G — > R 6e e-harmonic Lips- 
chitz, and let B{x,r) be a ball for some r > 1. TTien 

IIV/Hp^,,.)) « \S\°^{- r \\f\\ t 2 (B{xar)) + er\B s {2r)\ 1 ' 2 ). 

Proof. Let ■0 : G — > R be the cutoff function ip(y) '■= max(l — ^Sl^hMl ; o). Then it 
will suffice to show that 

(13) ElWlV(2/)<<|5|° (1) (i 2 -|l/ll 2 2 (B(.,2,.))+ £V l^( 27 -)l)- 



A FINITARY VERSION OF GROMOV'S POLYNOMIAL GROWTH THEOREM 



17 



We may clearly restrict the sum on the left to B(x,2r — 1). Now, for any y £ 

B(x, 2r — 1) and s £ S we have 

(14) 

(f(ys)-f(y))i> 2 (y) = (/V' 2 (ys)-/^ 2 (2/))-/(y^(y)(^M-V'(y))-/M(^M-^(y)) 2 . 

From the triangle inequality, dist(x, ys) differs from dist(a;, y) by at most 1, and 
thus 

i>{ys)-i>{y) = 0{l/r). 
Also, f{ys) = 0(\f(y)\ + |V/(t/)|). Multiplying (O by f{ys) - f(y) and summing 
in s, we conclude that 

|V/| = V(/^ 2 ) • V/ + 0(|5|(|/(y)| + |V/(y)|)(^M + 1)). 

Inserting this into the left-hand side of (| 13[) and summing by parts, we conclude 
that 

E IWlV 2 (2/) « E l/V> 2 G/)l|A/(i/)| + ^ E IW(y)l(l/(z/)l 

2/6G yGG y&G 

(15) . . 

+ |V/(y)|)^(y) + y E l v /(2/)l(l/(2/)l + |V/(y)|). 

yeB(a,2r-l) 

As / is e-harmonic, we can use Cauchy-Schwarz to bound 

J2\N 2 (y)m(y)\<e E 

y£G y£B s (2r-l) 

<e|i?5(2r)| 1 / 2 ||/||, 2(Bs(:Ci2j ,_ 1)) 
1 

^2 II J IU2(B s (x,2r-l))- 

Another application of Cauchy-Schwarz gives 

l -y E |V/(y)|(|/(y)|+|V/(y)|)^(y) < | V /(y)|V(tf)+^ (!/(2/)l+|V/(y)|) 2 

yGG ySG yGG 

for any c > 0. By choosing c small enough, we can then absorb the first term on 
the right-hand side into the left-hand side of (|15l) and conclude that 

E |V/|V(l/) « e*r*\B s {2r)\ + l^qf^^^ + || V/H^^). 
yea 

But from definition of V and the triangle inequality we see that 

\\^f\\p(B s (x,2r-l)) < |-9|||/||^(B s (x,2r)) 

and the claim follows. □ 

Now wc need some more definitions. Given R > 0, define the symmetric bilinear 
form Qji : R G x R G — >• R by the formula 

(16) Qr(u,v) := E («(s)-«(id))(t;(a:)-«(id)). 

This form is clearly positive semi-definite (with finite rank) . Given any U\ , . . . , : 
G — > R, we define the i?- volume V61h(«i, . . . , ttfe) by the formula 

Vo1_r(ui, . . . , Ufe) = det((Q R (ui, Uj))i<ij< k ) 1/2 ; 



< £ V|i? s (2r)| + -11/11 
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the right-hand side is non-negative due to the positive semi-definite nature of Q R . 
Geometrically, one can view Vo\r(ui, . . . , uu) as the length of the wedge product 
«i A ... A Mi in f\ R G , using the induced semi-definite form from Q R . 

The Q R are monotone increasing in R as symmetric bilinear forms, which implies 
the monotonicity relationship 

(17) Vo\ R (ui,...,u k ) < Vol4 R (ui,...,u k ). 

When k is large, and the Ui, . . . , Uk are approximately harmonic, we can improve 
this inequality by applying Proposition 17.31 on large balls and Proposition 17.21 on 
small balls. More precisely, we have the following inequality (cf. [8j Lemma 3.16], 
[U Proposition 4.16]): 

Proposition 7.4 (Volume decrease). Let k > 1 be an integer, < e,6 < 1 and 
R > 1/(5. Let Mi, . . . , Uk ■ G — > R be s-harmonic Lipschitz functions. Suppose also 
that 

Then we have 

Vol R (ux, . . . , u k ) < 0(\S\)°™ (V 2 ^Bs(SR)\ ) ^ YoUr ^ ■■■^) + ke 2 R k+ "\B s (4R)\ k ^ 
Proof. Let us write 

(19) v := Vol 4fl (Mi,...,M fe ). 
We may of course assume that 

(20) v > ke 2 R k+2 \B s (4R)\ k / 2 , 

otherwise we are done by (j 1 T[) . In particular, this implies that Hi, . . . , u k are linearly 
independent. Let V be the fc-dimensional subspace of R G spanned by the u\, . . . , Uk', 
the non-vanishing of v implies that V is a Hilbert space with respect to the bilinear 
form Q 4R . 

We will now study a subspace V' of V consisting of "locally mean zero" func- 
tions. Let x\, . . . , x m be a maximal 2<5i?-separated subset of Bs(R). Then the balls 
B(xj,6R) for 1 < j < m are disjoint and contained in B(2R), so we have the upper 
bound 

\B S (2R)\ 
~ \Bs(SR)\- 

We then introduce the subspace 

V' :={ueV : u B[X]t2SR) = for all 1 < j < m; m(0) = 0} 

where we recall that ub ■— pjj YlyeB u (v) i s ^he mean of u on B. Clearly V' is a 
subspace of V of codimension at most m + 1; by (fTSt . we conclude that 

(21) dim(F') > k/2. 

Using the Gram-Schmidt process, we may then find an orthonormal (with respect 
t° Qr) basis u\, . . . , Uk of V such that u\, . . . , uai m (y) lies in V' . 

Let 1 < i < dim (V). From Lemma [?T2l and the construction of V we have 

(22) Y, l^(y)l 2 «^ 2 ^ 2 E l v ^(y)l 2 )- 

yeB(xj,28R) yeB(xj,6SR) 
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Now suppose a point x is contained in J balls B(xj ,6SR). Then the J balls 
B{xj 1 8R) are contained in B(x,7SR). Since these balls are disjoint, we conclude 
that J < \Bs{7SR)\/\Bs(6R)\. On the other hand, all the B(xj, 6SR) are contained 
in Bs{1R). We can therefore sum (|22|) in j to conclude that 



j=l y£B(xj,25R) 



JyeBs(2R) 



On the other hand, by construction of the x_,- we see that the balls B(xj, 26 R) cover 
Bs(R). Since Ui(0) = 0, we thus see from (|16l) that 

(23) Q^X^ S E l^^)! 2 - 

1 Sl jl y&B s {2R) 

To estimate this, we wish to use Lemma 17.31 but first we must express u, as an 
approximately harmonic function. As the Uj have Lipschitz norm at most 1, we see 
from (TO)) that 

(24) QR(uj,Uj) <R 2 \B S (R)\ 

for all 1 < j < fc. Thus when represented in the orthonormal basis ui, . ■ ■ , Ufc, the 
ui, . . . , u& are elements of R fc of norm at most R\Bs{R)\ 1 ^ 2 ^ whose coordinates form 
a k x k matrix whose determinant of magnitude v. Taking adjugates (i.e. using 
Cramer's rule), we conclude that the inverse of this matrix has coefficients of size 
0{v- 1 {R\B s {R)\ 1 l 2 ) k ~ 1 ). In other words, the U\, . . . ,Uk are linear combinations of 
the Mi, . . . , Uk whose coefficients have size 0(v- 1 {R\B s (R)\ 1/2 ) k - 1 ). In particular, 
each Ui is equal to 0{kv~ 1 {R\Bg{R)\ 1 / 2 ) k ^ 1 ) times a e-harmonic Lipschitz function 
(note that the property of being e-harmonic Lipschitz is convex). We may then 
apply Lemma 17.31 to conclude that 

J2 |V^)| 2 «|S|°«(-^ My^ + ks'v-'R^BsiiRt/ 2 ). 

yGB s (2R) ydB s {iR) 

Since Ui(0) = 0, and Ui is a unit vector with respect to Q4R, we see from (I16p that 

E l^(y)l 2 = 1 - 

y£B s (4,R) 

Inserting these estimates and (|2U1) into (|2U)) we obtain 

Qr{Ui) K< 151 5 \B S (SR)\ 
for 1 < i < dim(T^'). Meanwhile, for dim(V') < i < m we have the crude bound 

Qr{u%) < QAR.{ui) = 1. 

We conclude (using (|2"TjO that 

Vol R (u 1 ,...,u k )<<0(\S\f^6^(\§^)^. 

\B S {6R)\ 

On the other hand, by orthonormality we have VoI^r^i, . . . , u^). Observe that the 
ratio Volfl(ui, . . . , Ufc)/ Vol4fl(iti, . . . , itfc) is invariant under row operations, and 
therefore by HH) we have 

Volfl( Ul) . . . , «*) « 0(|g|) o(fe) ^( l ^ s( J/^ l ) fe VoUhK, • • ■ ,«*) 

|S s ((5i?)| 
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and the claim follows. □ 

Now we iterate Proposition 17.41 to obtain 

Proposition 7.5 (Volume bound). Let k > 1 be an integer, 0<e<l, 0<k< 0.1, 
d >> 1, and Ro > 1. Assume that 

k > C d/K 

and 

Ro > C 1/K k c 

for some sufficiently large constant C. Then for every (Ro, d) -growth group (G, S), 
and every e-harmonic Lipschitz functions u\,...,Uk ■ G — > R, we have 

V0\ RI - K (U U ...,U k )« R y d+1 ° S WV-c&kUvk + £ 2 0{Ro) 0( kd ) 

for some absolute constant c > 0. 

Proof. We choose d by the solving the equation 

(25) k = C« los i 
for some large absolute constant C, thus 

(26) J = fc-distc. 

From our assumptions on k, Rq we see (for sufficiently large choices of constants) 
that 

(27) i?(7 al < S < 0.1. 

Let n be the largest integer such that 4 N Rl~ K < Ro, thus 

(28) 7V>Klogi? 

(note from hypothesis that Klogi? > 1). Set R n := 4™i?g~ K for 1 < n < N. By 
telescoping series and (P) we see that 

f[ \ B s( 2R n)\ tfOCdlogi) 0(1) ^logi 

11 155(^)1 K ° U[L) 
We thus conclude that for at least N/2 values of 1 < n < N, we have 



\B s (2R n )\ 



< O(l)^ log i 



\B s {SR n )\ 

By (j2"5)h we thus have (for C large enough) that 

for at least N/2 values of n. For each such n, we may apply Proposition 17.41 and 
conclude that 

Vo1h> 15 ...,u k )< 0(\S\)°^S k /\ ^^- ) k / 2 Vol Rn+1 (m, u k )+ks*R^\B s {m n )\ k l\ 

\B s {oR n )\ 

We may make the assumption 

(29) Yo\ R i-,{ Ul , ...,u k )> 2ke 2 R K +2+dk/2 
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since we are done otherwise. Then (by ([T]) and monotonicity of volume) we have 
Volfljui, ...,u k )> 2ke 2 R l ;+ 2 \B s (2R n )\^ 2 

and thus 

Vol R J Ul , ...,«„)< 0(jg|)°W^ ( lM^ )*/ 2 Vol Rn+1 ( Ul , ...,u k ) 

for at least N/2 values of n. For the other values of n, we see from (|17[) that 

Vol_R„(ui,...,u fc ) < Vol Rn+1 (ui,...,u fc ). 
Putting this all together and using monotonicity of volume again, we conclude that 

Vol K ^K ...,„*)< 0(lf N \SrS™/Hfl ^f ( l* n h k/2 Vo1h (ui, . . • , u k ). 

n=o \ B s{on n )\ 

On the other hand, by telescoping series and (Q} we have 

^ \B s (7SR n )\ Q(d) 

Also, arguing as in the proof (|24[) we have 

Qjio («»«<) <Rl\B s {R)\ << +2 

and thus 

Vol flo (ui, . . . , u k ) < R° (kd) . 

We conclude that 

Vol Jl i-»(« 1 ,...,« fc ) < 0(\S\) kN S km Bg lM >. 
Substituting in (f26|) . (|28|) we obtain the claim. □ 

In practice, we will only use this proposition in the regime where d,l/n, \S\ are 
bounded, k is sufficiently large depending on these parameters (but independent of 
Rq), and e is less than extremely large negative power of Rq. More precisely, we 
will use the following corollary of Proposition [73] 

Corollary 7.6 (Volume bound). Let < re < 0.1, d > 0, R a > 1, and Zet (G,S) 
be a (Roi d)- growth group. Suppose that 

k > (c\s\f d3/K2 

and 

Ro > k c 

for some sufficiently large absolute constant C. Then for any RQ Ckd -harmonic 
Lipschitz functions u\, . . . , u k ■ G — > R, we have 

Vo\ Rl -4u 1: ...,u k )<R^ mkd . 



Now we can prove Theorem 17.11 We first observe that it suffices to prove 
Theorem 17.11 for Rq ^-harmonic Lipschitz functions which vanish at the identity, 
since the general case can then be handled by adding the constant functions to 
V (increasing the dimension by one). By Corollary 17.61 and the hypothesis on 
Rq, we can find (if C is large enough) a threshold fc = 0(\S\)°^ 1+d ^ ' / K ^ such 
that Vo1^i-k(ui, . . . , u kn ) < Rq 100 f or a n jj- ^-harmonic Lipschitz functions 
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til, ... , Uk '■ G — > R. Using the greedy algorithm, one may then find R ^-harmonic 
Lipschitz functions tii, . . . , Uk '■ G — > R for some < k < ko such that 

Vol fl i- 1 .(ui,...,u*,u) < i?o lood Vol /? i- K ( Ul ,..., Ufc ) 

for all Rq ^-harmonic Lipschitz functions u : G — > R. But if we let V be the space 
spanned by u\ 1 . . . , u^, then from the base times height formula we have 

Voljji-* (tii, • • • , u k , u) = distQ Rl _ K (it, V) Vol H i-« (tti, . . . , u k ) 
and the claim follows. 

8. Third step for Proposition 15.21 Establishing an approximate 
isometric representation 

The group G acts on the space R G of functions u : G —> R by left translation, 
thus 

P(9)(u)(x) := u(g~ 1 x) 
for all u € R G , g £ G, x g G. Observe that this action preserves the vector space 
V of Lipschitz harmonic functions. In view of Kleiner's theorem, and assuming 
G has polynomial growth, this gives a finite-dimensional linear representation of 
G; on the quotient space V/H of V modulo the constant functions, the Lipschitz 
semi-norm becomes a norm, which is preserved by the group action. 

The purpose of this section is to establish an analogous claim for (Rq, d) groups 
rather than groups of polynomial growth. It is convenient to work modulo the 
constants. Let R G /R be the space of functions from G to R modulo addition by 
constants, and let tt : R G — > R G /R be the quotient map. Observe that the action 
p of G on R G descends to an action ~p on R G /R. One can also meaningfully define 
the concept of a e-harmonic Lipschitz function u in R G /R, since this concept is 
invariant under addition by constants. We can also define induced £ p (B) norms for 
every finite B C G by 

INI^^bJ := inf{|M|fP( S ) : tt(u) = u} 
for any u £ R G /R. For p = 2, this norm is associated with an inner product 

y€B 

where u, v are the unique lifts of u, v by tt that have mean zero on B. 

We will need a quotiented variant of Theorem l7.1l with an additional "good scale" 
Ri which is stable under translations. 

Proposition 8.1 (Quantitative Kleiner theorem with good scale). Let < k < 0.1, 

and let (G, S) be a (i?o, d)-growth group with d > 1 obeying 

(30) Rq>K:= {C\S\) cd3 / K3 

for some sufficiently large C . Then there exists a finite- dimensional subspace V of 
R G /R of dimension 

dim(V) < 0{\S\) oid3/K2) 
and a scale Rq~ 2i% < i?i < Rq K with two properties: 

• Every Rq K -harmonic Lipschitz function u S R G /R lies at a distance at 
most Ro 99d from V in the P(B s (Ri)) norm. 
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• For every u € V , we have 

( 31 ) IMU 2 (B S (ili)) < (1 + RQ^lMlt^BsiRt-Ri-*"))- 

Proof. We first make the observation that we may replace pip by the variant 
condition 

(3 2 ) IMI<>2(R C (R, ToV) - ^0 200d |l W 



at the slight cost of tightening the lower bound R\ > i?J~ 2K to i?i > 2R\~ 2k . 
Indeed, suppose that (|32|) held for all u £ V. Then, if we let Q r : V — >• R denote 



the positive definite quadratic form Q r {u) :— \\u\\ 2 2 for each r > 0, we see 

that det Qr 1 / det Qr 1 /2 < (i?2° 0d ) 2dlm ( y ) ( wnere we pi c k some arbitrary fixed basis 
of V with which to compute determinants). As the Q r are increasing in r, we may 
then use the pigeonhole principle (and the bounds on Rq and dim(V)) and find 
Ri/2 < R[ < R\ such that 

detQ R[ < (l + i? K ) 2 detQ fl ,_ fl i-4»; 

diagonalizing the quadratic form Qr^ with respect to Q R , _ r i~a^ we obtain pip 
with i?i replaced by the slightly smaller R[ . 

It remains to find a V and R\ obeying p"2j) . as well as the property about 
V approximating i? ( ^" ic -harmonic Lipschitz functions. From Theorem 17.11 and a 
quotienting by 7r, we see that if we (temporarily) set R\ :— R ~ K , we may find a 
subspace V of R G /R of dimension 0(\S\)°^ /k2) such that every R Q ^-harmonic 
Lipschitz function u € R G /R lies at a distance at most i? ~ 100d from in the 
e 2 (B s {Ri)) norm. _ 

If the property pi?)) holds for all «£ Vo with this choice of Ri, then we are done. 
Otherwise, suppose there exists u € Vq for which 

(33) ll u o||p( Ss ( i?1 /2)) < 200<i H u o|| £ 2( Bs ( fll )) 
Clearly, u is non-zero; we may normalize 

(34) H"o|l i2 (j?s(fll)) = 1- 

Let Vi be the orthogonal complement of uo in Vq, thus Vi has dimension one less 
than Vq- 

Now let u £ R G /R be a i?g~ x -harmonic Lipschitz function. By construction, we 
can find v £ Vq such that 

(35) \\u-v\\ l2{Bs{Ri)) <R^ 0d . 

On the other hand, u has Lipschitz constant at most 1 and thus (since constants 
have been quotiented out) 

Mw(BMm ^ Ri\ B s(Ri)\ 1/2 < Rl +d ' 2 - 

In particular 

« R 1+d/2 

\\ v We 2 (Bs(Ri)) ^ 



and hence by Cauchy-Schwarz and 



(36) (v, mo Wb-7r^t = 0{Rl +d ' 2 ) 
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We split v = vi + (v, M o) i2(g s ( fil )) M cb where v\ € V% is the orthogonal projection 
of v to Vi. From ([33]), (J3J]), (JMJ) the latter term is small on B s {R\/2): 

II ( U ' Uo )£ 2 (Ss(fli)) U °l^ 2 (Bs(i?^i/2)) - ^0 15 ° d 

(say). From this, (|35[) . and the triangle inequality we conclude that 

1-77 tt II s p-lOOrf , 0-150(2 

Thus, the property that Vq approximates i?^"^-harmonic Lipschitz functions has 
been inherited by V%, at the slight cost of reducing the scale R\ to R\/2 and 
increasing the error of approximation slightly from i?^ 100d to Rq WM + i?^" 150d . 
One can then iterate this process at most dim(Vo) = 0(\S\) 0( - d3 ^ K ' times until we 
find a space V and a scale R\ for which (f32|) is satisfied; the dimension bound and 
the largeness hypothesis on Rq ensures that R\ > Rq~ 2k , and that the total error 
of approximation never exceeds RQ 99d . The claim follows. □ 

Let k, (G, S), Rq, d, K, V, Ri be as in Proposition ^. 11 and set D := dim(y), thus 

(37) D = 0(\S\) o{d3/K2) . 

In practice, D should be viewed as bounded (especially when compared with the 
large parameter Rq), and so the factors of that appear below should be 

ignored at a first reading. 

Let C V be the set of all elements of V which lie within a distance Rq in 
i 2 {Bs{R\)) norm from a i?^"^-harmonic Lipschitz function. This is a symmetric 
convex subset of V with non-empty interior. Applying John's theorem [B], we may 
then find an ellipsoid E C V such that E C £1 C \f~D-E. If we let ei, . . . , en € V and 
Ai, . . . , Xd > be the principal orthonormal directions and radii of this ellipsoid 
with respect to the Hilbert space structure i 2 (Bs{Ri)) on V, we thus see that 

for i = 1, . . . , D, and conversely every element of Jl can be represented in the form 

D 

(38) Y^ tiXiei 

i=l 

for some tx,...,t D = 0(D°W). 

Since il (and hence y/~D • E) contains the ball of radius RQ 99d , we have the lower 
bound 

(39) Ai > RQ 99d /VD 



for all 1 < i < D. Also since R ^-harmonic Lipschitz functions have an £ 2 (Bs(Ri)) 
norm of at most R\\Bs{Ri)\ 1 ^ 2 < R Q d+1 ^ 2 , we have the upper bound 

(40) Xi « R { d+1)/2 

for 1 < i < D. 

Let 1 < i < D and g £ G. By construction, there exists a Rq -harmonic 
Lipschitz function Hi e R G /R with \\ui — Xiei\\p^ Bs ^ Rl ^ < RQ 99d . Translating 
this, we obtain 

\\p(g)Wi- X l -p{g)e l \\ P{g . Bs{Rl)) < Rq 99c1 . 
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In particular, if g e B s (Rl 5k ), then 

\\p{g)ui - ^iP(9)ei\\ e 2 {Bs{Rl _ R i-^ )) < Ro 99d - 

On the other hand, ~p(g)ui is also a Rq ^-harmonic Lipschitz function, and thus 
must lie within Rq in i 2 (Bs{Ri)) norm of some function f g ^ 6 CI. By the 
triangle inequality we thus have 

(41) \\f g ,i ~ m9)e*\\p { Bs( Rl -Rl-^)) < 2R o" d - 
Using (p8| . we can write 

D 

(42) f g ,i — ^""^ tg,j,i^j e j 

3=1 

for some t ff ,jj obeying the bounds 

(43) - />'" • 

Another bound on these coefficients is obtained by observing that 

\\ X iP(9) e t\\mBs(B. 1 -R 1 - 5K )) - M\ e i\\t 2 (Bs(Ri)) = X i 

and thus (by 0U, O) 

WfgA\i2(Bs(Ri-Rl,- r ° K )) ^ 

and thus (by (|5I])) 

\\fg,ih 2 (B s (Ri)) < £>° (1) Ai. 

Using (1121 and the orthonormal properties of ej we conclude that 

(44) - />" ' A, A,. 

We now investigate the extent to which the D x D matrices U g :— (t g j.i)i<j.i<D 
behave like a representation. From construction we see that we may take Uid = I, 
where / is the D x D matrix. Now we look at the multiplicativity. 

Proposition 8.2 (U g approximately multiplicative) . If g,h £ £?s(i?Q~ 5K /2) and 
1 < i, k < D, then the (k, i) entry of the matrix U g h — U g Uh, i.e. 

d 
3=1 

has magnitude 0(D ^ R^ 99d / \ k ) . 
Proof. From (gTJl, dHJ) we have 

D 

II *^2th, 3 ,i^jej - ^iP(h)ei\\ e2(Bs ( Ri _ R i-5K^ < 2R 99d 

3=1 

for all 1 < i < D; applying ~p(g) to this, we conclude 

D 

X/'<-.-< A .. 7^ .'/".; - X iP(9h)ei\\ lHBs(Rl _ 2R i-5 K)) < 2i?g 99d . 

3=1 
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Meanwhile, from (j4Tj), (J42J) we have 

D 

ll A jP(5) e j - ^2 t 9,k,j^ke k \\ e 2 {Bs{Rl _ 2R i-^ )) < 2i?g 99d 
fe=i 

and thus by the triangle inequality we have 

D D 

\\\{p(gh)ei - ^^t h . hl tg. k , ] \ k e k \\p2 {Bs{Rl ^2R 1 - 5K )) < D ^ R^ 99d . 
fc=l 3 = 1 

Meanwhile, from one final application of fiTj) . (|4"2")l we have 

D 

\\\{p{gh)ei - ^2tgh,kAkZk\\i,2i yBs{Rl ^2Rl-^)) < D° {1) R~ md 
fc=i 

and so by the triangle inequality 

D d 

|| ^[Vi,fc,i — i 'iJ,^9:*-'j] A fe e fellf 2 (Bs(_R. 1 -2fl ( l ) _5 ' t )) ^ D O(1 ^R " d 
fc=l j=l 

and thus by (I5TT) 

II /",fe?M,fc - X! < ' 1 .''J^J^] Afeefe l^ 2 (- B s(-R.i)) < D Oil) R 99d 
fc=l i=l 

As the e k are orthonormal, the claim follows. □ 

9. Fourth step for Proposition 15. 2\ Taking commutators 

We continue the discussion in the previous section. To simplify the expressions 
slightly we will make the smallness assumption 

d log | SI 

on K, and in particular from (|37[) 

(46) D<2°^° (1, \ 

For reasons that will be clearer later, we will also need to make Ro larger than 
previously assumed, in particular we assume that 

(47) Ro > 2 2 ° /kC 

for some sufficiently large absolute constant C . (In particular, quantities such as 

k 10 I2D 2 

R are still quite large.) 

The next step is to locate a large set of group elements g £ G (which will be 
commutators of other group elements) for which U g are very close to the identity 
matrix /. The key point here is that if U g , Uh arc within e oi I for some small 
e > (in some suitable matrix norm), then the commutator [U g ,Uh] is within 
0{\D\°^£ 2 ) of/. Meanwhile, from ([82]), we expect [U g , U h ] w U [gM . The strategy 
here can be viewed as a simplified variant of the argument used to prove the Solovay- 
Kitaev theorem 126 . 
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We turn to the details. Let us write 0\{X) to denote any matrix whose (k, i) en- 
try is 0(X/Xk), and O x (X) to denote any matrix whose (k, i) entry is 0(X min(l, Xi/Xk))- 
Then we can rewrite the conclusion of Proposition 18 . 21 as 

(48) U gh = U g U h + O x {D°^R^ 99d ) 
and rewrite gj), (|43]) as 

(49) U g = 0£(D°W). 

We also observe the multiplication laws 
(50) 

O x x (X)O x x {Y) = O x x {D°^XY); O x x {X)O x {Y), O x (X)O x x {Y) = O x (D°^XY). 
In particular we have 

(51) UgUg-^Ug-iUg = I + O x (D°^R 99d ) 

for all g € Bs{Rl~ 5K /2) . 

We have the following fundamental fact: 

Lemma 9.1 (Commutator bound). If g,e,e' <E B s (RI~~ 5k /100) and < s < 1 are 
such that 

(52) U e ,U e , =I + 0$(s) + O x (X), 
for some Rq 99c1 < X < R^ 9Sd , then 

U g[e , e , ]g -i = I + O x (D°^e 2 ) + O x (D°^X). 
Proof. From (gHJ) , flM]), d3DJ> we have 

U [e>el] = U e U e ,U e -iU {e , } -i + O x {D°^X). 
Splitting U e > = (U e t — I)+I and using ([ST]). (|l9]). ((50)) we conclude that 

(53) U [e>e , } = U e (U e , - I)U e -iU [el) -i + U [el) -,+Ox(D°^X). 
From dH]), {50} we have 

([/ e - I){U e , - I), (Ue, - I)(U e -I) = O x (D°^e 2 ) + O x {D°^X) 
and thus 

U e (U e , — I) = (U e , - I)U e + O x (D°^e 2 ) + O x (D°^X). 

Inserting this into ([53]) and using (|48]l. (|49|) . (|50|) we conclude that 

U M = {U e , - 7)C7 (e0 -i + C7 (e0 -i + O x (D°^e 2 ) + O x (D°^X). 

Applying (|53")) again we obtain 

CT^ = I + 0^(D 0(1 h 2 ) + O x {D°^X). 

If we multiply this on the left by U g and on the right by U g -i and use ([ST]) , (ji§|). 
(I5T)1) we obtain the claim. □ 

We would like to iterate this bound to find many g with U g very close (e.g. 
O(Rq 50c1 )) to /, but to get started we will need to locate a preliminary supply of g 

for which U g is somewhat close (e.g. 0(Rq K ^ 2D )) to /. Morally, this should follow 
from the Dirichlet box principle (i.e. the pigeonhole principle) since the U g (and 
Ug-i) are morally localized to a compact set of matrices (thanks to ([49)0 and are 
approximately multiplicative. We now make this intuition precise. 
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Lemma 9.2 (Box principle). There exists a subgroup (G',S') of G with S' C 
B s (Rg W ) and finite index \G : G'\ < R^° such that U e = I+O x {D°^ R~ K " ' /2D ' ') + 
O x {D°^R^ 99d ) for all e £ 5'. 

Proof. From l|49[). the matrices U g for g £ Bs(Rl~ 5K /2) are contained in a set of 
matrices the form {U : U = O x (D°^')}. From the convexity of the conditions 
used in the O x Q notation (and the largeness hypothesis on Ro), we may cover this 
D 2 -dimensional set by M < Rq W balls B\, ... , Bm of the form B m = {U : U = 
U m + <D$(R- k1 °/ 2d2 )} for some U m = 0\{D ^). (Here we are using g7]) to clean 
up the bounds somewhat.) 

For each r > 0, let A r C {1, . . . ,M} be the set of those 1 < m < M such 
that U g £ B m for some g £ Bs(r). Clearly the A r are increasing in r, so by the 
pigeonhole principle there exists 1 < r < M such that A r+ i = A r . 

Fix this r. For each m £ A r , let g m £ Bs(r) be a representative such that 
Ug m £ B m . Since A r+ \ = A r , we see that for each g £ B$(r + 1) there exists 
m £ A r such that U g £ B m , and in particular 

U a = U 9m +O x x (R- KW/2D2 ). 
Multiplying by U -i on the right and using (f51~j) . (|49)) , ([50]) . we see that 

U gg -i = I + O x x (D°^R- Kl0/2D2 ) + O x {D°^R-™ d ) 

and similarly 

U gmg -t = I + O x x (D°^R- rW/2d2 ) + O x (D°^R^ 9d ). 

Let 5" denote the set of all gg^,g m g~ x that arise in this manner, then S' C 
Bg{2r + 1) is symmetric and 

B s (r + 1) C S' ■ { 9l , . . .,g M } C S' ■ B s (r). 

Iterating this we see that 

B s {r + n) c B s <(n)-B s {r) C B s ,(n + 1) • {g u . . . ,g M } 

for all n; thus if G' denotes the group generated by S', then on taking unions as 
n — > oo we conclude that 

Gc G' ■{g 1 ,...,g m }. 
Thus G' has index at most M, and the claim follows. □ 

— n 10 /2D 2 

Write e := R , thus 

(54) U e = I+ O x (D°^e) + O x {D°^R^ 99d ) 

for all e £ S' . 

Since (G, S) is a (i?o, d)-growth group and 5" C Bs(Rq ), we see that (G 1 , S') 
is a (Rl~ K ,d/(l — K 10 ))-growth group. Applying Lemma W7f\ we see that the 
commutator group (G')^ 2 ' := [G', G'] can be generated by a set (S 1 )^ of generators 
in Bs'(Rq ) of the form g\e,e'\g~ x for some e, e' £ S' and ,g G Bs>(Rq ), and 

furthermore ((G') 2 ,(S") (2) ) is a (R^ 2k , d/(l-2 K 10 ))-growth group. From Lemma 
I9.1l we conclude that 

U e =I + O x (D°^e 2 ) + O x (D°^Ro 99d ) 
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for all e € (<S")( 2 - ) , where the O(l) exponents are larger than those in (|54|) by a 
multiplicative absolute constant. 
Let I be the first integer such that 

(55) e 2 ' < R 10Qd 
or equivalently 

2 l (n/2D 2 ) > lOOd 

thus bv (|3Tjl. (j45]) 

(56) ^(l+log|5|)< K - 6 . 



We can iterate the above procedure I times and conclude that the I th group (G')W 
in the derived series of G' is generated by a set (S')"' € Bs>(Rq) with the property 
that 

t/ e = / + (^(£> 2 ° ( V) + A (D 2 ° (i) i?o " d ) 
for all e <= (S') (0 - By (gU), (H), (0DJ, ([55]) we may clean this up as 

U e = I + O x (Rv 90d ) 
(say). From (|H]l . (|42|) we conclude that 

||p(e)(A i e i ) - Aiei|| ^ (Bs(iii)) < i?^ 80 ^ 

(say) for all 1 < i < D and e e which by ([38} implies that 

\\p(e)f-f\\ e . {Bs(Rl)) «Rv 70d 

(say) for all / G O and e G (S')V>. In particular, for any iJ^^-harmonic Lipschitz 
function u, we see from the triangle inequality that 

(say) for all e € (5") (0 . Unpacking the definition of the P(B s (Ri)) norm, and 
noting that Bs(Ri) has cardinality at most Rq, we conclude that 

u(eg) - u{eh) = u(g) - u(h) + O(R^ 50d ) 

(say) for all g,h £ Bs{Ri) and e € (S 1 )^ (here we use the symmetry of In 
particular, for e,e' £ (S')"' and g € Bs{Ri/2) we have 

u(ee'fl) = u(e fl ) + u(e' fl ) - u(fl) + O(R^ 0d ). 

Reversing e and e' and subtracting we conclude that 

u(ee'g) = u(e'eg) + 0(Ro 50d ), 

and thus 

u([e,e'}g) = u(g) + O(R^ 50d ), 
for e,e' € (S")^- 1 and <? G -Bs(i?i/4), which implies (after replacing <? by g~ 1 h, and 
it(-) by w(5-)) that 

(57) u( 5 [ e , e V^)="W + C , ( J R - 50d ) 

for all B s (Ri/8) and e,e' € (S") (0 . 

By invoking Lemma \i. 71 one last time, one can find a set (S')( l+lS> of generators 
of (G') (/+1) in B s >(Rl K ) of the form c^e'^" 1 for some e, e' <= (S") (0 and 5 € 
B s {Rl K ), and so from JST} we have concluded the following: 
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Theorem 9.3 (Many trivial directions for harmonic Lipschitz functions). Let < 
K < 0.1, Ro,d > 1, and let G be a (Ro,d)- growth group. Assume the bounds (1 4 5 j) . 
(|47[) for some sufficiently large C . Then there exists a {Rq K , d/(l — n 10 ))-growth 
subgroup (C , S') of G of index at most Rq and a positive integer I — 0(k~ 6 ) such 
that (G') ( ' +1) is generated by a set (S") (i+1) C B S ,(R% K ) obeying the bound 

(58) u(ex) = u{x) + 0(Rq 50d ) 

for all e G (S'p l+1 ', x G Bs(R\~^ K ), and all Rq K -harmonic Lipschitz functions 
u : G — > R, where K is defined by (|30[) . 

Remark 9.4. An instructive example here is that of the Heisenberg group G = 
'l Z Z\ 

1 Z I , with the elementary row operations as generators. One can show 
,0 lj 

that the only harmonic Lipschitz functions u : G — > R are those functions which 
are affine-linear combinations of the near-diagonal coefficients i,z of the group 
fl x y\ 

element 1 z I , and in particular such functions are invariant with respect to 

\° V 

fi o i\ 

the vertical element e := 1 1. One way to see this is to use the harmonicity 

to observe the reproducing formula u = u * for any m, where fT*-™ 1 -* are the 
random walk distributions defined in Proposition 16.31 As e is central, one then has 
d e u — u*de<T^ m \ where d e f(x) :— f(ex) — f(x). But a computation shows that the 
total variation of d e a^ m ^ is 0(l/m 2 ) (the intuition here is that behaves like 
uniform probability measure on a box of dimensions 0(y/m) in the x, z directions 
and O(m) in the y directions); in contrast, u, being Lipschitz, can only fluctuate 
by at most 0(m) on the bulk of the support of a^ 71 '. Estimating things carefully 
and taking limits as m — > oo we conclude that d e u = 0, at which point it is easy 
to verify the claim. This example illustrates the general phenomenon, established 
in our companion paper |24) (using a different method), that Lipschitz harmonic 
functions on nilpotent groups vanish along iterated commutator directions; in fact 
modulo the constants they are exactly the additive group characters. 



10. Final step for Proposition 15.21 non-trivial harmonic Lipschitz 
functions have large range 

An easy application of the maximum principle shows that any non-constant 
harmonic function u : G — > R must attain an infinite number of values. We thus 
expect any e-harmonic Lipschitz function u : G — > R obeying some non-degeneracy 
condition (e.g. a lower bound on Vu(id)) to also take on a large number of values 
in any given ball Bs{R). 

In fact we will need a stronger result (under a polynomial growth hypothesis), 
which asserts that a non-degenerate e-harmonic function must in fact fluctuate by 
> R on the ball B S (R): 

Proposition 10.1 (Lower bound on range). Let (G,S) be a (Ro,d)-growth group 
for some Rq, d > 1, let < k < 0.1, and suppose that (|12[) holds for some sufficiently 
large absolute constant C . Let u : G —¥ R be an Rq K -harmonic function such that 
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|Vu(id)| > 1/|5|, where K was defined in $T2j) . Then for every R^ 5k <R< R^ 2k 
we have 

(59) sup \u(x)-u(id)\^>0(\S\)-°^ 3 / K ^Ro K R. 

xeBs(R) 

Proof. Our main tool here will be the quantitative Kleiner theorem (Theorem [7TT]). 
This theorem gives us a space V C R G of dimension 

D := dim(V) = O(|^|) 0(d3/K2) 

such that for every g G G, there exists v g £ V such that 

\\p(9)n-v g \\ e2(Bs{Rl - n) <i? - 100d 

and in particular 

(60) \\ P (g)Vu - Vv g \\ eHBs(Rlr)) < \S\R- 100d . 

The balls Bs(R') for Rq K R < R' < R are increasing in R', and have cardinal- 
ity between 1 and Rq. By the pigeonhole principle, one may thus find a radius 
2R- K R <R'<R such that 

(61) \B s (R')\^O(l) d ^\B s (R'/10)\. 

Fix this R'. By subtracting off a constant, we may assume u(id) = 0. Write 
a := R'Rq K + R^ 1 sup xeBs ( R ,~ ) \u(x)\, then 

£ \u(x)\ 2 < (R') 2 a 2 \B S (R')\. 
xeBs(R') 

Applying Proposition 17.31 we conclude that 

(62) \\Vuy (Bs{R , m) « \S\°Wa\Bs(R')\^. 

Now, let gi, . . . , gD+i be chosen uniformly at random from Bs{R' /10). From 
(|6T1) . (|62j) . and Chebyshev's inequality we see that for each distinct 1 < i,j < D + l, 
we have 

(63) \p( gi )V(u)( 9j )\ < 

with probability at least 1 - 0(|S , |)°(< i3 /' t2 )a 2 . By the union bound, we thus have 
([63]) for all distinct 1 < i, j < D + 1 with probability at least 1 - 0(|S|) o(d3/K2) a 2 . 
Meanwhile, for i = j we have 

\p{9i)V{u){ gj )\ > 1/\S\ 
by hypothesis. Applying (|63|) we see that 

Mv gi )( 9j )\ < 

for i 7^ j and 

1 

2\S\ 

for i = j. The matrix (V(u fli )(sy))i<ij<.D+i is then diagonally dominant and 
thus invertible; however, the v 9l , . . . , v SD+1 lie in a D-dimcnsional space and thus 
must have a linear dependence. This leads to a contradiction unless the stated 
event occurs with zero probability; this forces a ^> 0(\S\)~°( d l K and the claim 
follows. □ 



|VK,)fe)l>^ 
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We can combine this with Theorem 19.31 and Proposition 16.31 to obtain a crucial 
reduction in growth order, from d to approximately d — 1: 

Corollary 10.2 (Reduction in growth order). Let < k < 0.1, Ro,d > 1, and let 
(G, S) be a (Rq, d)-growth group. Assume the bounds (|4"5)l . (|47l) /or some sufficiently 
large C . Then at least one of the following holds: 

• G = B s (i?o Xp(cxp( ^ 0<1,)) ). 

• There exists a (Rq~ k >d/(l — n 10 ))-growth subgroup (G',S") of G of in- 
dex at most Rq and a positive integer I = 0(k~ 6 ) such that (G')^ +1 ^ is 
generated by a set C Bs'(i^ K ) for which ((G') ( ' +1) , is a 
(i?o~ 4K , (d- 1 + 6k)/(1 - An))-growth group. 

Proof. Applying Proposition 16.31 we see that either the first conclusion holds, or 
there exists a Rq ^-harmonic Lispchitz function with |Vit(id)| > where K is 

defined by (fT2|) . Applying Theorem l9.3l we can ensure that (|55|) holds for this value 
of u; iterating this we see in particular that 

(64) u{gx) = u(x) + O(i?o 40d ) 

for all e e B {s , )(l+1) (i?^ 4K ), a; € B s (i?^ 4K ). 

By Proposition 110.11 (and cleaning up the constants) we have 

sup \u(x) -u(id)| > 10i?o" 6K 

and thus there exists a path of length at most Rq~ Ak starting at the origin on 
which u fluctuates by at least > lOi? 1 , -6 ". Since u also has Lipschitz constant at 
least 1, we can thus find M > Rq~ 6 ^ points xi,...,xm € i?s(i?Q _4K ) such that 
u(x\), . . . , u(xm) all differ by at least 1. Combining this with (|64|) we see that the 
sets B/g/\(i+i) (_Rq _4k ) • x m for 1 < m < M are disjoint. But these sets all lie in 
Bs(Ro), which has cardinality at most Rq. We conclude that 

r> /jjl— 4k\ ^- rid /n/r r>d— 1+6k 

and the claim follows. □ 

Finally, we can prove Proposition [572] 

Proof of Proposition 1 5. ,21 Applying Lemma 14.21 with k = ^ for some sufficiently 
small absolute constant c > 0, one can find a (Rq , i?o^ d )-subgroup (G, S) of G 

1 _ 2 

which is a (i?o 100d ,d - 0.01)-growth group and with \S\ < 0(l) d and G = 

G ■ B-siR^ ). We then apply Corollary [TOJ to (G, S 1 ) with « := J, for some suffi- 
ciently small absolute constant c > 0. If the first conclusion of Corollary [TCh!] holds, 

then the properties relating (G, S) to (G, S) imply that G = B 5 (i?o Xp(oxp(0(d)0<1))) ). 
If the second conclusion of Corollarv llO. 21 holds, then the claim follows after substi- 
tuting in the value of k (and replacing I by I + 1), and using Lemma 14. II (Observe 
from Lemma 14.81 that G' is a (Rq ,Rq , l)-subgroup of G.) □ 



A FINITARY VERSION OF GROMOV'S POLYNOMIAL GROWTH THEOREM 



33 



11. First step for Proposition 15.31 reduction to a cyclic base 

We now begin the proof of Proposition 15.31 In this section we execute the first 
step of this proof, which is to reduce to the case when the base group L is cyclic 
rather than solvable. 

By replacing S with S U Sh , replacing Rq by i?g^ 10 , and adjusting d and G 
appropriately, we may assume that Sh C S in Proposition l5.3| thus (H, Sh) is now 
a (oo, l)-subgroup of (G, S). 

We now claim that Proposition ^ .31 follows from the I — 1 case of this proposition, 
and specifically from 

Proposition 11.1 (Semi-quantitative Milnor-Wolf theorem, first reduction). Let 
f, Ro, d > 1, and suppose that 

R > C d r c 

for some sufficiently large absolute constant C. Suppose we have a short exact 
sequence 

O^H^G^ A 

of groups, where G = (G,S) is a (Rq, d)-growth group, (H.Sh) is a virtually r- 
nilpotent (oo,l)- subgroup of (G,S), and A is abelian. Then (G,S) is virtually 
r + 0(l) d -nilpotent. 

To see how Proposition 111.11 implies Proposition 15.31 we induct on I. The case 
I = 1 already follows from Proposition lll.il so suppose that I > 2 and the claim 
has already been proven for I — 1 . 

Let k := 1QQdl , then by Lemma l4~7l we may find a set of generators S' C Bs(Ro)C\ 
[G, G] for [G, G]. If we then set G := ([G, G],H) and 5:=5„US'c B S (R$), we 
see that (G,S) is a (i?g" K ,d/(l — /c))-growth group, and we have the short exact 
sequence 

-> H -> G -> [N, N] -> 0. 
Of course, [N, N] is solvable of derived length at most l — l, so by induction hypoth- 
esis (G, S) is virtually r + (I — l)0(l) d -nilpotent. (Note that even after iterating 
this induction hypothesis I times, the order of growth d of G does not increase sig- 
nificantly, thanks to the choice of k, so for the purposes of computing quantitative 
bounds one can treat d as constant throughout this iteration.) On the other hand, 
G/G is abelian. If we then apply Proposition 1 1 1 . 1 1 to the short exact sequence 

O^G^G^G/G^O 

we obtain the claim (for C large enough). 

Now suppose we are in the situation of Proposition lll.il At present, the abelian 
group (A, Sa) could have many generators; but we can cut down the number of 
generators to a quantity depending on d. Indeed, as G = (G,S) is an (Ro,d)- 
growth group, we see that (A, Sa) is also an (i?o, (i)-growth group, where Sa is the 
projection of S to A. Applying Lemma l4~2l we may find a (i?Q 9 , d/0.9)-growth 
(-Rg S-Rg-^-subgroup {A',S A >) of (A, S) with \S A >\ < 0(l) d . Let G' be the preim- 
age of A' in G, and let S' := ^(flg- 1 ) n A', then (G',S') is a \ R° Q ^-subgroup 
of (G, S). We may thus replace G, A, S by G', A', S' in Proposition ll 1 .11 (adjusting 
d and G slightly), thus allowing us to reduce to the case when A is generated by at 
most m = 0{l) d elements oi, . . . , a m , and furthermore that we may assume that 
elements are contained in the projection of S to A. 
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As (H, Sh) is a (00, l)-subgroup of (G, S), Sh is contained in S. We can then 
discard all elements of S other than those in Sh and those that are projecting to 
A, and thus assume that S takes the form 

S = S H U {ei, . . . , e m , ef , . . . , e~ } 

where e±, . . . , e m £ G projects to the generators a±, . . . , a m of A. 

An easy induction on m then allows us to reduce to the one-dimensional case 
m = 1, and specifically from 

Proposition 11.2 (Semi-quantitative Milnor-Wolf theorem, second reduction). 

Let r, Ro, d > 1, and suppose that 

Ro > C d r c 

for some sufficiently large absolute constant C. Suppose we have a short exact 
sequence 

0^ H -+ G -+ A 

of groups, where (G,S) is a (Rq, d)-growth group, (H,Sh) is a virtually r-nilpotent 
subgroup, and A is cyclic. Suppose also that S = Sh U {e, e -1 } for some e G G. 
Then G is virtually r + 1-nilpotent. 

It remains to establish Proposition 111.21 This will be accomplished in Section 
[T4l after some preliminaries in Sections [12J [13] 

12. Second step for Proposition 15.31 slow growth of iterated 

conjugation 

Suppose we are in the situation of Proposition 111.21 The short exact sequence 
forces H to be a normal subgroup of G, and so the generator e induces an auto- 
morphism T : H — > H defined by the conjugation operation Th := ehe~ l . As Sh 
generates H, we see that there exists some radius R such that T(Sh),T~ 1 (Sh) C 
Bs H (R) ■ This gives rise to the crude bound 

\\T n h\\ SH < R H \\h\\ SH 

for all 71 e Z and h £ H, where the norms \\\\s H were defined in Dcfinition ll.il 

This bound is useless for our purposes because the argument gives no effective 
bound on R. However, it turns out that one can use the polynomial growth hypoth- 
esis \Bs(Ro)\ < i?o to obtain a much stronger bound, after adjusting the generating 
set S slightly. Namely, we have 

Proposition 12.1 (Slow growth). Let Ro,d > 1 be such that Ro > C d for some 
sufficiently large absolute constant C. Suppose we have a short exact sequence 

0^ H -> G ->■ A 

of groups, where A is cyclic. Suppose also that G = (G, S) is a (Rq, d) -growth group 
with S = Sh U {e, e -1 }, where Sh C H generates H , and the projection e of S to 
A generates A. Then there exists a symmetric set S C Bs(Rq^ 10 ) H H generating 
H such that 

||T'^<<exp(M/iOll% 
for all n £ Z and h £ H, where T : H — > H is the conjugation map Th := ehe^ 1 . 
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Proof. By hypothesis, \B s (Rq)\ < R d . Fr om the pigeonhole principle and the lower 
bound on i? , we may then find a radius R^ 30 < R < R^ 20 such that 

\B s (10R)nH\ < 0(l) d \B s (R)nH\. 

Fix this R. For every < N < R, consider the sets 

A N := |J T n (B s (R) H if). 

— N<n<N 

Observe that the An are symmetric subsets of if that are increasing in N, and 
that B S (R) n if C A N C A N ■ (B S {R) n if) C B S (WR) n if, and thus 

\B S (R) DH\< \A N ■ (B S (R) n H)\ < 0(l) d \B s (R) n if|. 

By another application of the pigeonhole principle, there exists < N < R/2 such 
that 

\A n+c -*r ■ (Bs(R) n H)\ < \A N ■ (B S (R) n H)\ + \B S (R) n H| 
for some large absolute constant C. 

Fix this R, and set 5 := An, thus S is a symmetric subset of Bs(R^ 10 ) fl 
which contains and thus generates iJ. For every —C~ d R < n < C~ d R, we see 
from construction that 

(65) \{S U T n S) ■ (B S (R) nB)\<\§- (B S (R) n + |B S (J2) n 
This implies that 

S U T n S c S • (s s (i?) n H) ■ (B S (R) n if)- 1 , 

for if there was an element x € SUT n S which did not lie in S-(B s (R)nH)-(B s (R)n 
H)- 1 , then the set x ■ B S {R) n H) would lie in (S U T n S) ■ {B S {R) n F) but be 
disjoint from 5 1 • (Bs(R) fl i?), contradicting (|65|) . Since Bs(R) H is contained in 
the symmetric set S, we conclude that 

T n Sc Bg(3) 

and thus (as T n : H — > iJ is an automorphism) 

||r"% < 3||% 

for all /i G if and -C~ d R <n< C~ d R. From iteration we then obtain 

\\T n h\\ § «eMC d \n\/R)\\h\\s 

for all h G if and n G Z, and the claim then follows from the lower bounds on R 
and i?o- □ 

13. Third step for Proposition 15.31 the case of an action on lattices 

We are still preparing for the proof of Proposition lll.2l (and hence Proposition 
I5.3|) . Proposition 112. II places us in the setting of an automorphism T : H — > H on 
some virtually nilpotent group H whose iterates grow very slowly with respect to 
some word norm ||||s<. In this section we study a key model case of this situation, 
in which H = Z D = (Z D , +) is a lattice of bounded dimension, and the word norm 
|||s' is replaced by the Euclidean norm | • | on H D (and hence Z D ). The study 
of this case is central to all arguments of Milnor-Wolf type, see e.g. [32], or the 
appendix by Tits in [5]. The main result to establish in this section is as follows. 
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Proposition 13.1 (Dichotomy between periodicity and exponential growth). Let 
D > 1, and let T G SLd(Z) be an invertible linear transformation T : TP — > T D . 
Then at least one of the following statements hold. 

• (Periodicity) There exists a non-zero vector w G T D and an integer 1 < 
n < -D° (1) such that T n w = w. 

• (Growth) For any N > 1, there exists a non-zero vector v — vn G Z d such 
that 

(66) \T N v\ >exp(ciV/ J D 0(1) )|«| 

for some absolute constant c > 0. 

Proof. Let Ai, . . . , A^ G C be the eigenvalues of T (counting multiplicity). Then 
Ai . . . Xd = det(T) = ±1, which implies that maxi<j<£> |Aj| > 1. 

Suppose first that maxKjXD |Aj| = f , then all the Xj are algebraic integers whose 
Galois conjugates all lie on the unit circle. By a classical result of KroneckerflO] 
(or Remark 113.21 below) , this implies that the Xj are all roots of unity. If one of 
the Xj is a primitive n th root of unity, then the degree <fi(n) of that root cannot 
exceed D. Elementary number theory (using the prime factorization of n) yields 
that (f>(n) > c e n 1 ~ £ for any e > and some constant c £ > 0, and thus we have 
n = 0(D°W). Then T n — I has non-trivial kernel, and we obtain the periodicity 
claim. 

Now suppose instead that maxi<j<£) |Aj| > 1. Applying a result of Dobrowolski 
[2], we conclude in fact that maxi<j<£) |Aj| > 1 + cD^°^ for some c > (in fact 
the more precise bound 1 + c D /"J 5 lo ^s D is known). In particular we may assume 
that | Ai| > 1 + cD^ ^. Let V\ be an eigenvector of Ai, then clearly 

\T N Vl \ »exp(O(l)- D A0M. 

From the triangle inequality we see that either the real or complex part of v\ obeys 
a similar growth bound. Approximating this real or complex part by a non-zero 
vector with rational coefficients and then clearing denominators, we obtain the 
claim. □ 

Remark 13.2. For our applications, one may replace Dobrowolski's lower bound of 
1 + cD^°^ here with the more elementary bound of 1 + exp(— 0(D)), the proof of 
which we sketch as follows. Suppose we had an algebraic integer A of degree D, all 
of whose Galois conjugates were at most 1 + C~ D for some large absolute constant 
C, but which was not a root of unity. Using the minimal polynomial of A, one can 
find a diagonalisable transformation T : C D — > C D for some D' < D with integer 
coefficients and with eigenvalues equal to these Galois conjugates. One can then 
use the corresponding eigenvectors to design a symmetric convex body B with the 
property that T n (B) C 2 • B for all n < (C/10) D . By rescaling, one may assume 
that B contains a lattice vector v in TP on its boundary, but no non-zero lattice 
vector in its interior. As A is not a root of unity, the images of T n v are all distinct, 
and so 2 • B contains at least (C/10) D lattice vectors; but standard volume packing 
arguments show that this cannot be the case for C large enough. 

14. Final step for Proposition 15.31 eliminating the finite factors 

We are now ready to prove Proposition 1 11. 21 Let r, Ro, d, H, G, A, S, Sh, e be as 
in Proposition 1 11. 21 
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By hypothesis, H contains a finite index nilpotent subgroup H' of Hirsch length 
at most r, and thus step s < r. As is well known, H' then contains a finite index 
subgroup H" which is torsion free. 

The group H" need not be T-invariant. However, observe from Legendre's the- 
orem that h,\ H:H I € H" for all h £ H. Thus, if we let H'" be the group generated 
by {h) H - H "\ : h e H}, then H'" is a T-invariant normal subgroup of H". The 
nilpotent quotient group H" / H'" is generated by finitely many torsion elements, 
hence it is finite. We conclude that H'" is a T-invariant finite index torsion free 
subgroup of H, and thus has Hirsch length r. Let H"' be an upper central series 
for H'" terminating at H'J' := H'". By torsion freeness, we can identify H'J' /H' i "_ l 
with Z di for all i < s, with Y,di = r. 

Meanwhile, by Proposition ll2.11 there exists a set S C Bs{R^ W )C\H generating 
H such that 

(67) ||T»%«exp(|n|/i# 01 )ll% 

for all n £ Z and h £ H, where T : H — > H is the conjugation map Th := e/ie -1 . 

We claim that the S norm and S'" norm are comparable on H'" , i.e. there exists 
an M > such that 

(68) M-'WhWs", < \\h\\ § < M\\K\\ S ..> 

for all h £ H'". (We do not claim an effective bound on M.) The second inequality 
follows for sufficiently large M since each element of S'" is generated by S. To get 
the former inequality, observe as in Remark ll.4l that we can partition H into finitely 
many cosets x\ ■ H'", . . . , x m ■ H'" (with X\ = id, say) with relations exi — Xj e i g ej i 
for all 1 < i < m, e £ S and some 1 < j e .i < m, g e j £ H"'\ iterating these relations 
we see that any product of R elements of S can be expressed as the product of one 
of the Xi times R of the g e y, in particular, any product of R elements of S that lie 
in H'" can be expressed as the product of R of the g e< i, giving the desired inequality 
for some M. 

Inserting (|68[) into ([67)1 we conclude that 

(69) Hulls'" « M 2 exp(|n|/< 01 ) \\h\\s»> 

for all h £ H'" and n £ Z. 

We shall now argue similarly to Tits in his appendix to Gromov's [5], but in a 
quantitative manner. The automorphism T preserves each H"' and thus acts on the 
successive abelian quotients which are torsion free as well (cf. [18 ): H'/' /H"^ = 
Z di . Beginning from the top, by Proposition ll3.1[ either H'J' /H'J!_ 1 = Z ds contains 
a non-zero periodic vector with period at most 0(ds) 0<il ^ = Ofr) ^, or else one 
can find for any n > 1 a non-zero vector v in Z da such that 

\T n v\ >exp(n/0(r) o(1) )|w|. 

But by the lower bound on Rq, the latter possibility contradicts if one takes n 
sufficiently large, since one can use the norm in Z ds as a lower bound for the S'" 
norm. Thus Z ds contains a non-zero periodic vector v with some period 1 < po <C 
r O(i) xhis implies that the operator T Po — I (viewed as a linear transformation 
on Z ds ) maps Z ds to a T-invariant subgroup of Z ds of infinite index. Hence the 
map <&p : h h-> (T pd /i)/i _1 maps H'/L := H' S "/H'J!_ x into a T-invariant subgroup 
of it, H'/L (of strictly smaller dimension). We can iterate this process, finding a 
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1 < Pi < 0{r) ^ such that <J> Pl maps H'/L into a T-invariant subgroup H"^ of 
strictly smaller dimension, until these groups vanish, i.e., T takes H' s " = H'" into 
H'IL\. But then we can work on the abelian quotient H"!_ i/H'J!_ 2 — Z'' 3 - 1 instead. 
Standard nilpotent algebra then tells us that the square root \\ x / 2 of the norm on 
2«a-i j s essentially a lower bound for the S'" norm (continuing to rely heavily on 
torsion freeness), and we can continue the argument much as before, reducing the 
dimension of the T-invariant subgroups until they fall into H' s '!_ 2 , then H' s '!_ 3 , etc., 
eventually collapsing to the identity. Since H'" had Hirsch length r, we can thus 
find 1 < pi, ■ ■ ■ ,p m <C r°^ for some 1 < m < r and T-invariant subgroups 

H"> = H$ > Hfo >...> H^ +1) = {id} 

such that $ Pi maps H'/L to H'/! +l * for all 1 < i < m. If we let P := p 1 . . .p m <C r 0(r ) 
be the product of all these periods, it follows that T p acts unipotently on H'". This 
implies that 

{e nP h:ne Z,heH"'} 

is a finite-index subgroup of G which is nilpotent of Hirsch length (and thus step) 
at most r + 1, and Proposition II 1.21 follows. 

15. Effectivization 

In this section we discuss some fully effective results including Theorem 11.91 and 
the case of torsion free groups, and mainly remark on the modifications needed in 
the above arguments to make the bound on the quantity K(R , d) in Theorem 11.81 
effective. However, we will not provide complete details here for the latter, as they 
are rather lengthy, and the final bound on K{Rq 1 oI) obtained by this process is 
quite poor. 

15.1. Some fully quantitative results. We begin by mentioning one quantitative 
geometric application of Theorem 1 1.8[ based on Milnor's result [21] as used in [5j 
page 72]. 

Corollary 15.2 (to Theorem II .8j) . Let (V,d) be a complete Riemannian manifold 
of dimension n, and K > be such that the values of the Ricci tensor on the unit 
tangent bundle at all points is bounded from below by — (n — \)K . Let V be a group 
of isometries of V generated by the finite subset S. For a point v £ V define: 

6 V := mi{d(-fv, v)\-f e T} A v := sup{d(jv,v)\j € S} 

If for some v £ V and some R > exp(exp(C(2n) c ') the inequality: 

4^exp(2 7 rA t ,v / ^i?) < R 

holds, where C is the absolute constant of Theorem ] 1.81 then T is virtually nilpotent. 
This follows immediately from Milnor's inequality 21 

\B S (R)\ < 4"(^)"i?™exp(2^A„Vifi?) 

V 

combined with the main Theorem 11.81 to which the condition in Corollary 115.21 is 
tailored with the value d — 2n. Of course, the Corollary is of interest in situations 
where K, or both A V ,5 V and their quotient, are small. 



A FINITARY VERSION OF GROMOV'S POLYNOMIAL GROWTH THEOREM 



3!) 



We next discuss Theorem 11.91 This result is in fact a direct consequence of 
Proposition 15.21 above. More precisely, this Proposition immediately implies the 
following: 

Theorem 15.3. Let d,Ro be as in Theorem \1.8\ There exists effective (explicit) 
functions A(Ro,d), B(d) such that for every (Ro,d)- growth group (G,S) a sequence 
of at most B(d) operations of the type below reduce the group to the trivial group: 

1. Passing to a finite index subgroup of index bounded by A(Ro,d). 

2. Passing to the kernel of a homomorphism to a cyclic group (finite or infinite). 

The proof of this result goes simply by inspecting Proposition 15.21 In each 
step where 1 occurs, the worst bound on A(Ro,d) comes from possibility (i) in 
Proposition I5.2[ where A(Rq, d) has to be taken as an upper bound for the size of 

the ball ^(/j^* 1 ^ * 1 ^ ^ Otherwise, we pass to the finite index subgroup G' 

appearing in (ii), for which the inclusion G C P>s(Ro)G' holds, giving a bound Rq 
on its index. By Proposition 15.21 the number of operations B{d) performed in this 
process is obviously 0(d°^ 1 '), thus completing the proof of Theorem 115.31 

To complete the proof of Theorem 115.31 notice that every group which can be 
reduced to the trivial group using the operations 1 and 2 above is virtually poly- 
cyclic in an effective manner, by applying the following lemma as many times as 
necessary: 

Lemma 15.4. If a group G has a subgroup G' of index at most I which contains 
a polycyclic normal subgroup of index at most J , then G has a polycyclic normal 
subgroup of index at most (I ■ J) ! . 

Proof. By hypothesis, G has a polycyclic subgroup G" of index at most I J. The 
group G acts by left multiplication on the quotient set G/G"; the stabilizer H of 
this action then has index at most (/</)!, and is a normal subgroup of G that is 
contained in G" . Since any subgroup of a polycyclic group is polycyclic, the claim 
follows. □ 

Remark 15.5. Of course, only type 1 steps contribute to the computation of the 
total index (this makes the computation effective; unfortunately, the lack of rea- 
sonable control over the size of the finite quotients appearing in 2 is responsible for 
our inability to get a similarly effective result concerning the index of a nilpotent 
subgroup). 

Finally, the following observation shows that torsion is the only obstacle at this 
point to obtaining a fully effective version of Gromov's Theorem, an issue which 
will be shortly discussed in more detail over the next subsection. 

Corollary 15.6 (to Theorem ll.8|) . Retain the assumptions of Theorem \1.8i but 
assume further that G is torsion free. Then a finite index nilpotent subgroup as in 
the Theorem can be found with index q satisfying 

log log log q < C d 

where C is an absolute ( effective ) constant. 

This follows immediately from Corollary 11.121 and a result of Auslander and 
Schenkman - see page 71] and the reference therein. 
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15.7. Towards full effectivization of Theorem 11.81 For the purpose of full 
quantification one must forego the compactness argument in Section [3] that allowed 
us to reduce Theorem 11.81 to Theorem 13.21 Instead, one must replace "virtually 
r-nilpotent" by something more like ls (K, R, s, Z))-virtually nilpotent" throughout 
the arguments, making sure to keep the bounds on K, R effective. 

Of the three components of the proof in Section [5j the first two (Lemma 15.11 
and Proposition [5T2J) are already completely effective (and do not use the notion of 
virtual r-nilpotcncy). The main issue, therefore, is to locate an effective version of 
Proposition l5.3l This proposition is applied about O(d) times during the induction 
on the growth order d in the proof of Theorem 11.81 so any deterioration in the 
K, R constants occurring in that proposition will need to be iterated O(d) times 
to obtain the final bound on K(Ro,d). The arguments in Section 111) in which 
Proposition 15 . 31 is reduced to Proposition II 1.2[ are also quite effective, in the sense 
that any effective version of the latter can be converted by a routine modification 
of the arguments in that section to a quantitative version of the former. But as 
before, the deterioration of the bounds will worsen due to the induction on the step 
I and the dimension m appearing in the arguments in that section (more precisely, 
the bounds in Proposition 15 .31 will basically be a 0(l) d -fold iteration of the bounds 
in Proposition 111.21) . 

The key proposition in Section [T^l namely Proposition ll2.H is already effective. 
The key proposition in Section 1131 namely Proposition 113.11 can also be made 
effective without much difficulty, for instance by using the Kronecker approximation 
theorem to quantitatively approximate a vector by one with rational coefficients, 
and using Cramer's rule to bound all the vectors that are constructed from linear 
algebra (e.g. locating an integer vector in the null space of a matrix with integer 
coefficients). To deal with various denominators in Cramer's rule, one does need to 
obtain a quantitative lower bound on the difference | — | between two eigenvalues 
of a matrix T with integer coefficients, but this can be accomplished by using Galois 
theory to observe that the product Y[\ — °f a ^ these eigenvalue gaps is 

a non-zero rational integer, and in particular has magnitude at least 1. We omit 
the details. 

The only remaining parts of the argument which need more careful attention are 
those in Section 114) as it is here that one truly begins to exploit such qualitative 
notions as Hirsch length and torsion freeness. 

Recall that a group G = (G, S) is virtually r-nilpotent if it contains a finite index 
nilpotent subgroup G' = (G 1 , 5") which is torsion free of Hirsch length r (and thus 
step at most r). The notion of finite index can be made quantitative using the 
concept of a (K, i?)-subgroup as defined in Definition 11.31 Nilpotency of a given 
step is also a quantitative concept (being nilpotent of step s is equivalent to the 
s-fold commutators of the generating set S' vanishing). However, torsion freeness 
cannot be verified using only finitely many group operations on the generators and 
so should not be considered as quantitative. (For instance, the additive group 
Z/NZ with generating set { — 1,+1} has torsion, in contrast to the torsion- free 
group Z with the same set of generators, but it is only possible to distinguish the 
two groups after performing at least O(logiV) group operations on the generators, 
which is unbounded as N —¥ oo.) This is a genuine problem in the proof, as torsion 
freeness is used crucially in Section[14]in order to bound ||T"/i||s<» from below using 
the norm of a projection of T" h to a free abelian group Z di . 
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It is thus natural to seek unitary substitutes for the concept of being torsion- 
free. Call a abelian group G = (G, +) generated by a set S — {±ei, . . . ,±eu} of 
generators e\, . . . , en M -torsion-free for some M > 1 if the sums {niei + . . .+n£>eo '■ 
\m\ + . . . + \rirj\ < M} that make up Bs(M) are all distinct. This property, in 
contrast to the qualitative property of being torsion-free, can be verified in finite 
time using a finite number of group operations on generators, and so we consider this 
a quantitative property. A key ingredient in the proof that any finitely generated 
nilpotent group has a finite index torsion free subgroup is that same result for 
abelian groups. We now give a quantitative version of this fact. It is convenient 
to extend the asymptotic notation X <C Y by allowing the constants to depend 
on additional parameters, indicated by subscripts on the <C symbol; for instance, 
X <C_d Y means that X < C(D)Y for some quantity C(D) depending only on 
D. (Of course, to get effective bounds at the end of the day, it is important to 
ensure that all implied constants such as C(D) depend in an effective fashion on 
parameters such as D.) 

Lemma 15.8 (Quantitative location of an M-torsion-free group). Let G = (G, S, +) 

be a finitely generated abelian group with D generators S — {±ei, . . . , ±e£>}, and 
let F : R + — > R + be an arbitrary function. Then there exists an integer 1 < 
M < F:D 1 and a (M, 1) -subgroup (G',S") of G with S' = {±fi, . . . ,±f D >} for 
some 1 < D' < D, such that (G',S") is F(M) -torsion- free. 

Proof. (Sketch) We use the "rank reduction argument" . We begin by initializing 
(G', S') equal to (G, S) and initializing M := 1, then (G', S') is already an (M, 1)- 
subgroup of (G, S). If (G', S') is _F(M)-torsion-free then we are done. If not, write 
S' = {±/i, . . . , ±/d'} for some D' (which is initially equal to D). Because of the 
failure of (G', S') to be i^(M)-torsion-free, we must have a non-trivial dependence 

m/i + . . . + n D ,f D , = 

where < |ni| + . . . + \nr>' \ < 2F(M). Without loss of generality we may take no 1 
to have the largest magnitude, and in particular be non-zero. Then we see that 
the subgroup (G", S") of (G', S') generated by S" := {±f u . . . , ±f D '-i} has finite 
index (indeed, the index is at most |ti_d'|); more quantitatively, one can show that 
(G",S") is a (0(D'F(M)), l)-subgroup of (G',S'), which by Lemma O implies 
that (G",S") is a (M\ l)-subgroup of (G, S) for some M' = 0(DMF(M))°M. 
One then replaces (G',S'),D',M by (G" , S") , D" , M' respectively, and iterates 
this procedure. Since the rank D' starts at D and decreases by one at each stage, 
this algorithm terminates in at most D steps, and the claim follows. □ 

Remark 15.9. The eventual bound on M is essentially a D-fold iteration of F. 
Unfortunately, in applications F has to be quite a rapidly growing function (expo- 
nential or worse), which leads to quite poor bounds, especially after the induction 
loops on the growth order d, the solvability index I, and the dimension D that 
appear in the proof are carried out. It is thus of interest to reduce the dependence 
on the torsion-free property (and also to reduce the length of the induction loops) 
in order to improve the bounds. 

In a similar vein to the above discussion, it is possible to define a quantitative 
notion of an "AT-torsion free nilpotent group" , and establish a quantitative version 
of the fact that every finitely generated nilpotent group contains a finite index 
torsion free subgroup, which roughly speaking asserts that for any given F, any 
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(s, D)-nilpotent subgroup will have an (M, M)-subgroup which is F(M)-torsion 
free for some M ^f.s.d 1. This will turn out to be a usable quantitative substitute 
for the qualitative notion of torsion freeness if F is chosen to be sufficiently rapidly 
growing. Indeed, the various vectors in Z D which appear in the arguments in 
Section [14] will have norm bounded by some effective function of the parameter 
M (thanks to quantitative versions results such as Proposition 1 13 . 1 1 ) . and as long 
as F(M) is much larger than the norm of these vectors, then the M-torsion free 
nilpotent groups involved will behave "as if" they are genuinely torsion free for the 
purposes of the computations being performed. For instance, it would suffice to 
take F(M) := exp(exp(exp((C<iZAf ) c ') for some large absolute constant C . If one 
does this, then the final bounds obtained on K(Rq, d) are essentially an Ackermann 
function of i?o and 0(l) d . 
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